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PLANT STEROL ACYLTRANSFERASES 

CROSS-REFERENCE TO RELATED APPLICATIONS 

5 This application claims priority to U.S. provisional application Serial No. 60/1 52,493, 

filed August 30, 1999 and herein incorporated by reference in its entirety for all purposes. 

BACKGROUND 

10 Technical Field 

The present invention is directed to plant acyltransferase-like nucleic acid and amino 
acid sequences and constructs, and methods related to their use in altering sterol composition 
and/or content, and oil composition and/or content in host cells and plants. 

15 Related Art 

Through the development of plant genetic engineering techniques, it is now possible to 
produce transgenic varieties of plant species to provide plants which have novel and desirable 
characteristics. For example, it is now possible to genetically engineer plants for tolerance to 
environmental stresses, such as resistance to pathogens and tolerance to herbicides. It is also 

20 possible to improve the nutritional characteristics of the plant, for example to provide 

improved fatty acid, carotenoid, sterol and tocopherol compositions. However, the number of 
useful nucleotide sequences for the engineering of such characteristics is thus far limited. 

There is a need for improved means to obtain or manipulate compositions of sterols 
from biosynthetic or natural plant sources. The ability to increase sterol production or alter the 

25 sterol compositions in plants may provide for novel sources of sterols for use in human and 
animal nutrition. 

Sterol biosynthesis branches from the farnesyl diphosphate intermediate in the 
isoprenoid pathway. Sterol biosynthesis occurs via a mevalonate dependent pathway in 
mammals and higher plants (Goodwm,(1981) Biosynthesis of Isoprenoid Compounds, vol 1 
30 (Porter, J.W. & Spurgeon, S.L., eds) pp.443-480, John Wiley and Sons, New York), while in 
green algae sterol biosynthesis is thought to occur via a mevalonate independent pathway 
(Schwender, etal. (1997) Physiology, Biochemistry, and Molecular Biology of Plant Lipids, 
(Williams, J.P., Khan, M.U., and Lem, N.W., eds) pp. 180-182, Kluwer Academic Publishers, 
Norwell, MA). 



MTC 6718 
PATENT 



fc - 1 

2 MTC 6718 

PATENT 

The solubility characteristics of sterol esters suggests that this is the storage form of 
sterols (Chang, et al., (1997) Annu. Rev. Biochem., 66:613-638). Sterol O-acyltransferase 
enzymes such as acyl CoA:cholesterol acyltransferase (ACAT) and lecithinxholesterol 
acyltransferase (LCAT) catalyze the formation of cholesterol esters, and thus are key to 
5 controlling the intracellular cholesterol storage. In yeast, it has been reported that 

overexpression of LROl, a homolog of human LCAT, and phospholipid:diacylglycerol 
acyltransferase increased lipid synthesis (Oelkers et al., (2000) J. Biol. Chem., 26:15609- 
15612; Dahlqvist et al., (2000) Proc. Natl. Acad. Sci. USA, 97:6487-6492). 

The characterization of various acyltransferase proteins is useful for the further study 
10 of plant sterol synthesis systems and for the development of novel and/or alternative sterol 
sources. Studies of plant mechanisms may provide means to further enhance, control, 
?--= modify, or otherwise alter the sterol composition of plant cells. Furthermore, such alterations 
in sterol content and/or composition may provide a means for obtaining tolerance to stress 
and insect damage. Of particular interest are the nucleic acid sequences of genes encoding 
= 15 proteins which may be useful for applications in genetic engineering. 

SUMMARY OF THE INVENTION 

The present invention is directed to lecithinxholesterol acyltransferase-like 
;.= 20 polypeptides (also referred to herein as LCAT) and acyl CoAxholesterol acyltransferase-like 
polypeptides (also referred to herein as ACAT). In particular the invention is related to 
polynucleotides encoding such sterol:acyltransferases, polypeptides encoded by such 
polynucleotides, and the use of such polynucleotides to alter sterol composition and oil 
production. The polynucleotides of the present invention include those derived from plant 
25 sources. 

One aspect of the invention, therefore, is an isolated nucleic acid sequence encoding a 
plant lecithinxholesterol acyltransferase-like polypeptide, a fragment of a plant 
lecithinxholesterol acyltransferase-like polypeptide, a plant acyl CoAxholesterol 
acyltransferase-like polypeptide or a fragment of a plant acyl CoAxholesterol 
30 acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of 
SEQ ID NO: 2, 4, 6, 8, 10-29, 43-51, 73 or 75. Also provided is an isolated nucleic acid 
sequence consisting of SEQ ID NO: 2, 4, 6, 8, 10-29, 43-51, 73 or 75. 
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Still another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO: 3 or SEQ ID NO: 3 with at least one conservative amino acid 
substitution; SEQ ID NO: 2; an isolated polynucleotide that has at least 70%, 80%, 90%, or 
5 95% sequence identity with SEQ ID NO: 2; an isolated polynucleotide of at least 10 amino 
acids that hybridizes under stringent conditions to SEQ ID NO: 2; an isolated polynucleotide 
complementary to any of the foregoing; and an isolated polynucleotide that hybridizes under 
stringent conditions to SEQ ID NO: 2 and encodes a plant lecithin: cholesterol 
acyltransferase-like polypeptide. 

1 0 Still another aspect provides an isolated nucleic acid sequence consisting essentially 

of a polynucleotide of the formula 5' X-(Ri) n -(R 2 ) n -(R 3 ) I1 -Y 3' where X is a hydrogen, Y is a 
hydrogen or a metal, Rj and R 2 are any nucleic acid, n is an integer between 0-3000, and R 2 is 
selected from the group consisting of an isolated polynucleotide encoding a polypeptide of 
SEQ ID NO: 3 or SEQ ID NO: 3 with at least one conservative amino acid substitution; SEQ 

15 ID NO: 2; an isolated polynucleotide that has at least 70%, 80%, 90%, or 95% sequence 
identity with SEQ ID NO: 2; an isolated polynucleotide of at least 10 amino acids that 
hybridizes under stringent conditions to SEQ ID NO: 2; an isolated polynucleotide 
complementary to any of the foregoing; and an isolated polynucleotide that hybridizes under 
stringent conditions to SEQ ID NO: 2 and encodes a plant lecithinxholesterol 

20 acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ID NO:5 or SEQ ID NO: 5 with at least one conservative amino acid 
substitution; SEQ ID NO: 4; an isolated polynucleotide that has at least 70%, 80%, 90%, or 

25 95% sequence identity with SEQ ID NO: 4; an isolated polynucleotide of at least 10 amino 
acids that hybridizes under stringent conditions to SEQ ID NO: 4; an isolated polynucleotide 
complementary to any of the foregoing; and an isolated polynucleotide that hybridizes under 
stringent conditions to SEQ ID NO: 4 and encodes a plant lecithinxholesterol 
acyltransferase-like polypeptide. 

30 Another aspect provides an isolated nucleic acid sequence consisting essentially of a 

polynucleotide of the formula 5' X-CR^-CR^-CRa^-Y 3' where X is a hydrogen, Y is a 
hydrogen or a metal, R t and R 2 are any nucleic acid, n is an integer between 0-3000, and R 2 is 
selected from the group consisting of an isolated polynucleotide encoding a polypeptide of 



4 MTC 6718 

PATENT 

SEQ ID NO: 5 or SEQ ID NO: 5 with at least one conservative amino acid substitution; SEQ 
ID NO: 4; an isolated polynucleotide that has at least 70%, 80%, 90%, or 95% sequence 
identity with SEQ ID NO: 4; an isolated polynucleotide of at least 10 amino acids that 
hybridizes under stringent conditions to SEQ ID NO: 4; an isolated polynucleotide 
5 complementary to any of the foregoing; and/an isolated polynucleotide that hybridizes under 
stringent conditions to SEQ ED NO: 4 and encodes a plant lecithinxholesterol 
acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 

10 polypeptide of SEQ ID NO: 7 or SEQ ID NO: 7 with at least one conservative amino acid 
substitution; SEQ ID NO: 6; an isolated polynucleotide that has at least 70%, 80%, 90%, or 
95% sequence identity with SEQ ID NO: 6; an isolated polynucleotide of at least 10 amino 
acids that hybridizes under stringent conditions to SEQ ID NO: 6; an isolated polynucleotide 
complementary to any of the foregoing; and an isolated polynucleotide that hybridizes under 

15 stringent conditions to SEQ ID NO: 6 and encodes a plant lecithinxholesterol 
acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of a 
polynucleotide of the formula 5' X-(R 1 ) n -(R 2 ) n -(R 3 ) n -Y 3' where X is a hydrogen, Y is a 
hydrogen or a metal, and R 2 are any nucleic acid, n is an integer between 0-3000, and R 2 is 

20 selected from the group consisting of an isolated polynucleotide encoding a polypeptide of 
SEQ ED NO: 7 or SEQ ED NO: 7 with at least one conservative amino acid substitution; SEQ 
ED NO: 6; an isolated polynucleotide that has at least 70%, 80%, 90%, or 95% sequence 
identity with SEQ ED NO: 6; an isolated polynucleotide of at least 10 amino acids that 
hybridizes under stringent conditions to SEQ ED NO: 6; an isolated polynucleotide 

25 complementary to any of the foregoing; and an isolated polynucleotide that hybridizes under 
stringent conditions to SEQ ED NO: 6 and encodes a plant lecithinxholesterol 
acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 

30 polypeptide of SEQ ED NO:9 or SEQ ED NO: 9 with at least one conservative amino acid 
substitution; SEQ ED NO: 8; an isolated polynucleotide that has at least 70%, 80%, 90%, or 
95% sequence identity with SEQ ED NO: 8; an isolated polynucleotide of at least 10 amino 
acids that hybridizes under stringent conditions to SEQ ED NO: 8; an isolated polynucleotide 
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complementary to any of the foregoing; and an isolated polynucleotide that hybridizes under 
stringent conditions to SEQ ED NO: 8 and encodes a plant lecithinxholesterol 
acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of a 
5 polynucleotide of the formula 5' X-(Rj) n -(R^) n -(R 3 ) n -Y 3' where X is a hydrogen, Y is a 

hydrogen or a metal, R, and R 2 are any nucleic acid, n is an integer between 0-3000, and R 2 is 
selected from the group consisting of an isolated polynucleotide encoding a polypeptide of 
SEQ ED NO: 9 or SEQ ED NO: 9 with at least one conservative amino acid substitution; SEQ 
ED NO: 8; an isolated polynucleotide that has at least 70%, 80%, 90%, or 95% sequence 

10 identity with SEQ ED NO: 8; an isolated polynucleotide of at least 10 amino acids that 
hybridizes under stringent conditions to SEQ ED NO: 8; an isolated polynucleotide 
complementary to any of the foregoing; and an isolated polynucleotide that hybridizes under 
stringent conditions to SEQ ED NO: 8 and encodes a plant lecithinxholesterol 
acyltransferase-like polypeptide. 

1 5 Another aspect provides an isolated nucleic acid sequence comprising a 

polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
polypeptide of SEQ ED NO:74 or SEQ ED NO: 74 with at least one conservative amino acid 
substitution; SEQ ED NO: 73; an isolated polynucleotide that has at least 70%, 80%, 90%, or 
95% sequence identity with SEQ ED NO: 73; an isolated polynucleotide of at least 10 amino 

20 acids that hybridizes under stringent conditions to SEQ ED NO: 73; an isolated polynucleotide 
complementary to any of the foregoing; and an isolated polynucleotide that hybridizes under 
stringent conditions to SEQ ED NO: 73 and encodes a plant lecithinxholesterol 
acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of a 

25 polynucleotide of the formula 5' X-^VOR^-^VY 3' where X is a hydrogen, Y is a 

hydrogen or a metal, R, and R 2 are any nucleic acid, n is an integer between 0-3000, and R 2 is 
selected from the group consisting of an isolated polynucleotide encoding a polypeptide of 
SEQ ED NO: 74 or SEQ ED NO: 74 with at least one conservative amino acid substitution; 
SEQ ED NO: 73; an isolated polynucleotide that has at least 70%, 80%, 90%, or 95% 

30 sequence identity with SEQ ED NO: 73; an isolated polynucleotide of at least 10 amino acids 
that hybridizes under stringent conditions to SEQ ED NO: 73; an isolated polynucleotide 
complementary to any of the foregoing; and an isolated polynucleotide that hybridizes under 
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stringent conditions to SEQ ED NO: 73 and encodes a plant lecithin:cholesterol 
acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of an isolated polynucleotide encoding a 
5 polypeptide of SEQ ED NO: 76 or SEQ ED NG: 76 with at least one conservative amino acid 
substitution; SEQ ED NO: 75; an isolated polynucleotide that has at least 70%, 80%, 90%, or 
95% sequence identity with SEQ ED NO: 75; an isolated polynucleotide of at least 10 amino 
acids that hybridizes under stringent conditions to SEQ ED NO: 75; an isolated polynucleotide 
complementary to any of the foregoing; and an isolated polynucleotide that hybridizes under 

10 stringent conditions to SEQ ED NO: 75 and encodes a plant lecithinxholesterol 
acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of a 
polynucleotide of the formula 5' X-(R 1 ) n -(R 2 ) n -(R 3 ) n -Y 3' where X is a hydrogen, Y is a 
hydrogen or a metal, R, and R 2 are any nucleic acid, n is an integer between 0-3000, and R 2 is 

15 selected from the group consisting of an isolated polynucleotide encoding a polypeptide of 
SEQ ED NO: 76 or SEQ ED NO: 76 with at least one conservative amino acid substitution; 
SEQ ED NO: 75; an isolated polynucleotide that has at least 70%, 80%, 90%, or 95% 
sequence identity with SEQ ED NO: 75; an isolated polynucleotide of at least 10 amino acids 
that hybridizes under stringent conditions to SEQ ED NO: 75; an isolated polynucleotide 

20 complementary to any of the foregoing; and an isolated polynucleotide that hybridizes under 
stringent conditions to SEQ ED NO: 75 and encodes a plant lecithin:cholesterol 
acyltransferase-like polypeptide. 

Another aspect provides an isolated nucleic acid sequence comprising a 
polynucleotide selected from the group consisting of SEQ ED NO: 42 or a degenerate variant 

25 thereof; an isolated polynucleotide that has at least 70%, 80%, 90%, or 95% sequence identity 
with SEQ ED NO: 42; an isolated polynucleotide of at least 10 amino acids that hybridizes 
under stringent conditions to SEQ ED NO: 42; an isolated polynucleotide complementary to 
any of the foregoing; and an isolated polynucleotide that hybridizes under stringent 
conditions to SEQ ED NO: 42 and encodes an acyl CoAxholesterol acyltransferase-like 

30 polypeptide. 

Another aspect provides an isolated nucleic acid sequence consisting essentially of a 
polynucleotide of the formula 5' X-(R 1 ) n -(R 2 ) n -(R 3 ) n -Y 3' where X is a hydrogen, Y is a 
hydrogen or a metal, R! and R 2 are any nucleic acid, n is an integer between 0-3000, and R 2 is 
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selected from the group consisting of SEQ ID NO: 42 or a degenerate variant thereof; an 
isolated polynucleotide that has at least 70%, 80%, 90%, or 95% sequence identity with SEQ 
ID NO: 42; an isolated polynucleotide of at least 10 amino acids that hybridizes under 
stringent conditions to SEQ ID NO: 42; an isolated polynucleotide complementary to any of 
the foregoing; and an isolated polynucleotide that hybridizes under stringent conditions to 
SEQ ID NO: 42 and encodes a acyl CoA:cholesterol acyltransferase-like polypeptide. 

Also provided is a recombinant nucleic acid construct comprising a regulatory 
sequence operably linked to a polynucleotide encoding a lecithinxholesterol acyltransferase- 
like polypeptide and/or an acyl CoAxholesterol acyltransferase-like polypeptide. In one 
embodiment, the sterol acyl transferases are plant sterol acyl transferases. In another 
embodiment, the recombinant nucleic acid constructs further comprises a termination 
sequence. The regulatory sequence can be a constitutive promoter, an inducible promoter, a 
developmentally regulated promoter, a tissue specific promoter, an organelle specific 
promoter, a seed specific promoter or a combination of any of the foregoing. Also provided 
is a plant containing this recombinant nucleic acid construct and the seed and progeny from 
such a plant. This recombinant nucleic acid construct can also be introduced into a suitable 
host cell to provide yet another aspect of the invention. If the host cell is a plant host cell, the 
cell can be used to generate a plant to provide another aspect of the invention. Further 
aspects include seed and progeny from such a plant. 

Yet another aspect is a purified polypeptide comprising, SEQ ED NO: 3, SEQ ED NO: 
5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ED NO: 74, SEQ ED NO: 76, or any of the preceding 
sequences with at least one conservative amino acid substitution. 

Still another aspect provides a purified immunogenic polypeptide comprising at least 
10 consecutive amino acids from an amino acid sequence selected from the group consisting 
of SEQ ID NO: 3, 5, 7, 9, 74, 76 and any of the preceding sequences containing at least one 
conservative amino acid substitution. Also provided are antibodies, either polyclonal or 
monoclonal, that specifically bind the preceding immunogenic polypeptides. 

One aspect provides a method for producing a lecithinxholesterol acyltransferase-like 
polypeptide or an acyl CoAxholesterol acyltransferase-like polypeptide comprising culturing 
a host cell containing any recombinant nucleic acid construct of the present invention voider 
condition permitting expression of said lecithinxholesterol acyltransferase-like polypeptide 
or acyl CoAxholesterol acyltransferase-like polypeptide. 
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Another aspect provides a method for modifying the sterol content of a host cell, 
comprising transforming a host cell with a recombinant construct containing a regulatory 
sequence operably linked to a polynucleotide encoding a lecithinxholesterol acyltransferase- 
like polypeptide and culturing said host cell under conditions wherein said host cell expresses 
5 a lecithinxholesterol acyltransferase-like polypeptide such that said host cell has a modified 
sterol composition as compared to host cells without the recombinant construct. 

An additional aspect is a method for modifying the sterol content of a host cell 
comprising transforming a host cell with a recombinant construct containing a regulatory 
sequence operably linked to a polynucleotide encoding an acyl CoAxholesterol 
1 0 acyltransferase-like polypeptide and culturing said host cell under conditions wherein said 
host cell expresses an acyl CoAxholesterol acyltransferase-like polypeptide such that said 
host cell has a modified sterol composition as compared to host cells without the recombinant 
construct. 

A further aspect is a plant comprising a recombinant construct containing a regulatory 
1 5 sequence operably linked to a polynucleotide encoding a lecithinxholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in modified sterol 
composition of said plant as compared to the same plant without said recombinant construct. 

Another aspect provides a plant comprising a recombinant construct containing a 
regulatory sequence operably linked to a polynucleotide encoding an acyl CoAxholesterol 
20 acyltransferase-like polypeptide wherein expression of said recombinant construct results in 
modified sterol composition of said plant as compared to the same plant without said 
recombinant construct. 

In a further aspect is provided an oil obtained from any of the plants or host cells of 
the present invention. 

25 In still another aspect is provided a method for producing an oil with a modified sterol 

composition comprising providing any of the plants or host cells of the present invention and 
extracting oil from the plant by any known method. Also provided is an oil produced by the 
preceding method. 

Still another aspect provides a method for altering oil production by a host cell 
30 comprising, transforming a host cell with a recombinant construct containing a regulatory 
sequence operably linked to a polynucleotide encoding a lecithinxholesterol acyltransferase- 
like polypeptide and culturing the host cell under conditions wherein the host cell expresses a 



9 MTC 6718 

PATENT 

lecithin:cholesterol acyltransferase-like polypeptide such that the host cell has an altered oil 
production as compared to host cells without the recombinant construct. 

Another aspect provides a method for altering oil production by a host cell 
comprising, transforming a host cell with a recombinant construct containing a regulatory 
5 sequence operably linked to a polynucleotide encoding an acyl CoAxholesterol 

acyltransferase-like polypeptide and culturing the host cell under conditions wherein the host 
cell expresses an acyl CoAxholesterol acyltransferase-like polypeptide such that the host cell 
has an altered oil production as compared to host cells without the recombinant construct. 

Also provided is a plant comprising a recombinant construct containing a regulatory 
10 sequence operably linked to a polynucleotide encoding a lecithinxholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in an altered 
production of oil by said plant as compared to the same plant without said recombinant 
construct. 

In a further aspect is provided a plant comprising a recombinant construct containing 
15 a regulatory sequence operably linked to a polynucleotide encoding an acyl CoAxholesterol 
acyltransferase-like polypeptide wherein expression of said recombinant construct results in 
an altered production of oil by said plant as compared to the same plant without said 
recombinant construct. 

Additional aspects provide a food, food ingredient or food product comprising any oil, 
" 20 plant or host cell of the present invention; a nutritional or dietary supplement comprising any 
oil, plant or host cell of the present invention; and a pharmaceutical composition comprising 
any oil, plant or host cell of the present invention along with a suitable diluent, carrier or 
excipient. 

Additional aspects will be apparent from the descriptions and examples that follow. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 



These and other features, aspects, and advantages of the present invention will 
become better understood with regard to the following description, appended claims and 
30 accompanying figures where: 

Figure 1 shows an alignment of yeast, human and rat lecithinxholesterol 
acyltransferase protein sequences with Arabidopsis LCAT1, LCAT2, LCAT3, and LCAT4 
deduced amino acid sequences. 
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Figure 2 shows the results of NMR sterol ester analysis on T2 seed from plant 
expressing LCAT4 under the control of the napin promoter (pCGN9998). 

Figure 3 shows the results of HPLC/MS sterol analysis on oil extracted from T2 seed 
from control lines (pCGN8640) and lines expressing LCAT3 (pCGN9968) under the control 
5 of the napin promoter. *■ 

Figure 4 shows the results of HPLC/MS sterol analysis on oil extracted from T2 seed 
from control lines (pCGN8640), and plant line expressing LCAT1 (pCGN9962), LCAT2 
(pCGN9983), LCAT3 (pCGN9968), and LCAT4 (pCGN9998) under the control of the napin 
promoter. Additionally, data from 3 lines expressing LCAT4 under the control of the 35S 
1 0 promoter (pCGN9996) are shown. 

Figure 5 shows the results of Nir analysis of the oil content of T2 seed from control 
lines (pCGN8640), and plant lines expressing LCAT1 (pCGN9962), LCAT2 (pCGN9983), 
and LCAT3 (pCGN9968) under the control of the napin promoter. Additionally, data from 
16 lines expressing LCAT2 under the control of the 35S promoter (pCGN9981) are shown. 

15 

DETAILED DESCRIPTION 



The following detailed description is provided to aid those skilled in the art in 
practicing the present invention. Even so, this detailed description should not be construed to 
_ 20 unduly limit the present invention as modifications and variations in the embodiments 

discussed herein can be made by those of ordinary skill in the art without departing from the 
spirit or scope of the present inventive discovery. 

All publications, patents, patent applications and other references cited in this 
application are herein incorporated by reference in their entirety as if each individual 
25 publication, patent, patent application or other reference were specifically and individually 
indicated to be incorporated by reference. 

The present invention relates to lecithinxholesterol acyltransferase, particularly the 
isolated nucleic acid sequences encoding lecithin: cholesterol-like polypeptides (LCAT) from 
plant sources and acyl CoAxholesterohacyltransferase, particularly the isolated nucleic acid 
30 sequences encoding acyl CoAxholesterol acyltransferase-like polypeptides (ACAT) from 
plant sources. Lecithinxholesterol acyltransferase-like as used herein includes any nucleic 
acid sequence encoding an amino acid sequence from a plant source, such as a protein, 
polypeptide or peptide, obtainable from a cell source, which demonstrates the ability to utilize 
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lecithin (phosphatidyl choline) as an acyl donor for acylation of sterols or glycerides to form 
esters under enzyme reactive conditions along with such proteins polypeptides and peptides. 
Acyl CoAxholesterol acyltransferase-like as used herein includes any nucleic acid sequence 
encoding an amino acid sequence from a plant source, such as a protein, polypeptide or 
5 peptide, obtainable from a cell source, which demonstrates the ability to utilize acyl CoA as 
an acyl donor for acylation of sterols or glycerides to form esters under enzyme reactive 
conditions along with such proteins polypeptides and peptides. By "enzyme reactive 
conditions" is meant that any necessary conditions are available in an environment (i.e., such 
factors as temperature, pH, lack of inhibiting substances) which will permit the enzyme to 
10 function. 

The term "sterol" as applied to plants refers to any chiral tetracyclic isopentenoid 
which may be formed by cyclization of squalene oxide through the transition state possessing 
stereochemistry similar to the trans-syn-trans-anti-trans-anti configuration, for example, 
protosteroid cation, and which retains a polar group at C-3 (hydroxyl or keto), an all-trans- 
15 anti stereochemistry in the ring system, and a side-chain 20R-configuration (Parker, et al. 
(1992) In Nes, et al, Eds., Regulation of Isopentenoid Metabolism, ACS Symposium Series 
No. 497, p. 110; American Chemical Society, Washington, D.C.). 

Sterols may or may not contain a C-5-C-6 double bond, as this is a feature introduced 
late in the biosynthetic pathway. Sterols contain a C 8 -Ci 0 side chain at the C- 17 position. 
20 The term "phytosterol," which applies to sterols found uniquely in plants, refers to a 

sterol containing a C-5, and in some cases a C-22, double bond. Phytosterols are further 
characterized by alkylation of the C-17 side-chain with a methyl or ethyl substituent at the C- 
24 position. Major phytosterols include, but are not limited to, sitosterol, stigmasterol, 
campesterol, brassicasterol, etc. Cholesterol, which lacks a C-24 methyl or ethyl side-chain, 
25 is found in plants, but is not unique thereto, and is not a "phytosterol." 

"Phytostanols" are saturated forms of phytosterols wherein the C-5 and, when present, 
C-22 double bond(s) is (are) reduced, and include, but are not limited to, sitostanol, 
campestanol, and 22-dihydrobrassicastanol. 

"Sterol esters" are further characterized by the presence of a fatty acid or phenolic 
30 acid moiety rather than a hydroxyl group at the C-3 position. 

The term "sterol" includes sterols, phytosterols, phytosterol esters, phytostanols, and 
phytostanol esters. 
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The term "sterol compounds" includes sterols, phyotsterols, phytosterol esters, 
phytostanols, and phytostanol esters. 

The term "phytosterol compound" refers to at least one phytosterol, at least one 
phytosterol ester, or a mixture thereof. 
5 The term "phytostanol compound" refers to at least one phytostanol, at least one 

phytostanol ester, or a mixture thereof. 

The term "glyceride" refers to a fatty acid ester of glycerol and includes mono-, di-, 
and tri- acylglycerols. 

As used herein, "recombinant construct" is defined either by its method of production 
10 or its structure. In reference to its method of production, e.g., a product made by a process, 
the process is use of recombinant nucleic acid techniques, e.g., involving human intervention 
in the nucleotide sequence, typically selection or production. Alternatively, in terms of 
structure, it can be a sequence comprising fusion of two or more nucleic acid sequences 
which are not naturally contiguous or operatively linked to each other 
15 As used herein, "regulatory sequence" means a sequence of DNA concerned with 

controlling expression of a gene; e.g. promoters, operators and attenuators. A "heterologous 
regulatory sequence" is one which differs from the regulatory sequence naturally associated 
with a gene. 

As used herein, "polynucleotide" and "oligonucleotide" are used interchangeably 
20 and mean a polymer of at least two nucleotides joined together by a phosphodiester bond and 
may consist of either ribonucleotides or deoxynucleotides. 

As used herein, " sequence" means the linear order in which monomers appear in a 
polymer, for example, the order of amino acids in a polypeptide or the order of nucleotides in 
a polynucleotide. 

25 As used herein, " polypeptide" , "peptide" , and " protein" are used interchangeably 

and mean a compound that consist of two or more amino acids that are linked by means of 
peptide bonds. 

As used herein, the terms "complementary" or "complementarity" refer to the pairing 
of bases, purines and pyrimidines, that associate through hydrogen bonding in double 
30 stranded nucleic acids. For example, the following base pairs are complementary: guanine 
and cytosine; adenine and thymine; and adenine and uracil. The terms, as used herein, 
include complete and partial complementarity. 
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Isolated proteins, Polypeptides and Polynucleotides 

A first aspect of the present invention relates to isolated LCAT polynucleotides. The 
polynucleotide sequences of the present invention include isolated polynucleotides that 
encode the polypeptides of the invention having a deduced amino acid sequence selected 
from the group of sequences set forth in the Sequence Listing and to other polynucleotide 
sequences closely related to such sequences and variants thereof. 

The invention provides a polynucleotide sequence identical over its entire length to 
each coding sequence as set forth in the Sequence Listing. The invention also provides the 
coding sequence for the mature polypeptide or a fragment thereof, as well as the coding 
sequence for the mature polypeptide or a fragment thereof in a reading frame with other 
coding sequences, such as those encoding a leader or secretory sequence, a pre-, pro-, or 
prepro- protein sequence. The polynucleotide can also include non-coding sequences, 
including for example, but not limited to, non-coding 5' and 3' sequences, such as the 
transcribed, untranslated sequences, termination signals, ribosome binding sites, sequences 
that stabilize mRNA, introns, polyadenylation signals, and additional coding sequence that 
encodes additional amino acids. For example, a marker sequence can be included to facilitate 
the purification of the fused polypeptide. Polynucleotides of the present invention also 
include polynucleotides comprising a structural gene and the naturally associated sequences 
that control gene expression. 

The invention also includes polynucleotides of the formula: 
X-CR^-O^MRsVY 

wherein, at the 5' end, X is hydrogen, and at the 3' end, Y is hydrogen or a metal, R t and R 3 
are any nucleic acid residue, n is an integer between 0 and 3000, preferably between 1 and 
1000 and R 2 is a nucleic acid sequence of the invention, particularly a nucleic acid sequence 
selected from the group set forth in the Sequence Listing and preferably SEQ ID NOs: 2, 4, 6, 
8, 10-29, 33, 42-5 1, 73 and 75. In the formula, R 2 is oriented so that its 5' end residue is at 
the left, bound to R l5 and its 3' end residue is at the right, bound to R 3 . Any stretch of nucleic 
acid residues denoted by either R group, where R is greater than 1, may be either a 
heteropolymer or a homopolymer, preferably a heteropolymer. 

The invention also relates to variants of the polynucleotides described herein that 
encode for variants of the polypeptides of the invention. Variants that are fragments of the 
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polynucleotides of the invention can be used to synthesize full-length polynucleotides of the 
invention. Preferred embodiments are polynucleotides encoding polypeptide variants 
wherein 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues of a polypeptide sequence of the 
invention are substituted, added or deleted, in any combination. Particularly preferred are 

5 substitutions, additions, and deletions that are silent such that they do not alter the properties 
or activities of the polynucleotide or polypeptide. 

Further preferred embodiments of the invention that are at least 50%, 60%, or 70% 
identical over their entire length to a polynucleotide encoding a polypeptide of the invention, 
and polynucleotides that are complementary to such polynucleotides. More preferable are 

10 polynucleotides that comprise a region that is at least 80% identical over its entire length to a 
polynucleotide encoding a polypeptide of the invention and polynucleotides that are 
complementary thereto. In this regard, polynucleotides at least 90% identical over their entire 
length are particularly preferred, those at least 95% identical are especially preferred. 
Further, those with at least 97% identity are highly preferred and those with at least 98% and 

1 5 99% identity are particularly highly preferred, with those at least 99% being the most highly 
preferred. 

Preferred embodiments are polynucleotides that encode polypeptides that retain 
substantially the same biological function or activity as determined by the methods described 
herein as the mature polypeptides encoded by the polynucleotides set forth in the Sequence 
20 Listing. 

The invention further relates to polynucleotides that hybridize to the above-described 
sequences. In particular, the invention relates to polynucleotides that hybridize under 
stringent conditions to the above-described polynucleotides. An example of stringent 
hybridization conditions is overnight incubation at 42°C in a solution comprising 50% 

25 formamide, 5x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 
7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 micrograms/milliliter denatured, 
sheared salmon sperm DNA, followed by washing the hybridization support in 0.1 x SSC at 
approximately 65°C. Also included are polynucleotides that hybridize under a wash 
stringency of 0.1X SSC or 0.1X SSPE (at 50°C. Other hybridization and wash conditions are 

30 well known and are exemplified in Sambrook, et al, Molecular Cloning: A Laboratory 
Manual, Second Edition, Cold Spring Harbor, NY (1989), particularly Chapter 11. 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 
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complete gene for a polynucleotide sequence set for in the Sequence Listing under stringent 
hybridization conditions with a probe having the sequence of said polynucleotide sequence or 
a fragment thereof; and isolating said polynucleotide sequence. Methods for screening 
libraries are well known in the art and can be found for example in Sambrook, et al., 
5 Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, NY (1989), 
particularly Chapter 8 and Ausubel et al., Short Protocols in Molecular Biology, 3 rd ed, 
Wiley and Sons, 1995, chapter 6. Nucleic acid sequences useful for obtaining such a 
polynucleotide include, for example, probes and primers as described herein and in particular 
SEQ ID NO: 2, 4, 6, 8, 10-29, 33, 42-51, 73 and 75. These sequences are particularly useful 

10 in screening libraries obtained from Arabidopsis, soybean and corn for sequences encoding 
lecithinrcholesterol acyltransferase and lecithinrcholesterol acyltransferase-like polypeptides 
and for screening libraries for sequences encoding acyl Co A: cholesterol acyl transferase and 
acyl Co A: cholesterol acyl transferase-like polypeptides. 

As discussed herein regarding polynucleotide assays of the invention, for example, 

1 5 polynucleotides of the invention can be used as a hybridization probe for RNA, cDNA, or 

genomic DNA to isolate full length cDNAs or genomic clones encoding a polypeptide and to 
isolate cDNA or genomic clones of other genes that have a high sequence similarity to a 
polynucleotide set forth in the Sequence Listing and in particular SEQ ID NO: 2, 4, 6, 8, 10- 
29, 33, 42-51, 73 and 75. Such probes will generally comprise at least 15 bases. Preferably 

20 such probes will have at least 30 bases and can have at least 50 bases. Particularly preferred 
probes will have between 30 bases and 50 bases, inclusive. 

The coding region of each gene that comprises or is comprised by a polynucleotide 
sequence set forth in the Sequence Listing may be isolated by screening using a DNA 
sequence provided in the Sequence Listing to synthesize an oligonucleotide probe. A labeled 

25 oligonucleotide having a sequence complementary to that of a gene of the invention is then 
used to screen a library of cDNA, genomic DNA or mRNA to identify members of the library 
which hybridize to the probe. For example, synthetic oligonucleotides are prepared which 
correspond to the LCAT EST sequences. The oligonucleotides are used as primers in 
polymerase chain reaction (PCR) techniques to obtain 5' and 3' terminal sequence of LCAT 

30 genes. Alternatively, where oligonucleotides of low degeneracy can be prepared from 
particular LCAT peptides, such probes may be used directly to screen gene libraries for 
LCAT gene sequences. In particular, screening of cDNA libraries in phage vectors is useful 
in such methods due to lower levels of background hybridization. 
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Typically, a LCAT sequence obtainable from the use of nucleic acid probes will show 
60-70% sequence identity between the target LCAT sequence and the encoding sequence 
used as a probe. However, lengthy sequences with as little as 50-60% sequence identity may 
also be obtained. The nucleic acid probes may be a lengthy fragment of the nucleic acid 

5 sequence, or may also be a shorter, oligonucleotide probe. When longer nucleic acid 
fragments are employed as probes (greater than about 100 bp), one may screen at lower 
stringencies in order to obtain sequences from the target sample which have 20-50% 
deviation (i.e., 50-80% sequence homology) from the sequences used as probe. 
Oligonucleotide probes can be considerably shorter than the entire nucleic acid sequence 

10 encoding an LCAT enzyme, but should be at least about 10, preferably at least about 15, and 
more preferably at least about 20 nucleotides. A higher degree of sequence identity is desired 
when shorter regions are used as opposed to longer regions. It may thus be desirable to 
identify regions of highly conserved amino acid sequence to design oligonucleotide probes 
for detecting and recovering other related LCAT genes. Shorter probes are often particularly 

1 5 useful for polymerase chain reactions (PCR), especially when highly conserved sequences 
can be identified. (See, Gould, et al, PNAS USA (1989) 5(5:1934-1938.). 

Another aspect of the present invention relates to LCAT polypeptides. Such 
polypeptides include isolated polypeptides set forth in the Sequence Listing, as well as 
polypeptides and fragments thereof, particularly those polypeptides which exhibit LCAT 

20 activity and also those polypeptides which have at least 50%, 60% or 70% identity, preferably 
at least 80% identity, more preferably at least 90% identity, and most preferably at least 95% 
identity to a polypeptide sequence selected from the group of sequences set forth in the 
Sequence Listing, and also include portions of such polypeptides, wherein such portion of the 
polypeptide preferably includes at least 30 amino acids and more preferably includes at least 

25 50 amino acids. 

"Identity", as is well understood in the art, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by comparing 
the sequences. In the art, "identity" also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as determined by the match between strings of such 

30 sequences. "Identity" can be readily calculated by known methods including, but not limited 
to, those described in Computational Molecular Biology, Lesk, A.M., ed., Oxford University 
Press, New York (1988); Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., 
Academic Press, New York, 1993; Computer Analysis of Sequence Data, Parti, Griffin, 
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A.M. and Griffin, H.G., eds., Humana Press, New Jersey (1994); Sequence Analysis in 
Molecular Biology, von Heinje, G., Academic Press (1987); Sequence Analysis Primer, 
Gribskov, M. and Devereux, J., eds., Stockton Press, New York (1991); and Carillo, H., and 
Lipman, D., SIAM J Applied Math, 48:1073 (1988). Methods to determine identity are 
5 designed to give the largest match between the sequences tested. Moreover, methods to 
determine identity are codified in publicly available programs. Computer programs which 
can be used to determine identity between two sequences include, but are not limited to, GCG 
(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); suite of five BLAST 
programs, three designed for nucleotide sequences queries (BLASTN, BLASTX, and 
1 0 TBLASTX) and two designed for protein sequence queries (BLASTP and TBLASTN) 
(Coulson, Trends in Biotechnology, 12: 76-80 (1994); Birren, et al., Genome Analysis, I: 
543-559 (1997)). The BLAST X program is publicly available from NCBI and other sources 
{BLAST Manual, Altschul, S., et al, NCBI NLM NIH, Bethesda, MD 20894; Altschul, S., et 
al, J. Mol. Biol, 215:403-410 (1990)). The well known Smith Waterman algorithm can also 
15 be used to determine identity. 

Parameters for polypeptide sequence comparison typically include the following: 
Algorithm: Needleman and Wunsch, J. Mol Biol. 48:443-453 (1970) 
Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoff, Proc. Natl. Acad. 
Sci USA 89:10915-10919 (1992) 
20 Gap Penalty: 12 

Gap Length Penalty: 4 

A program which can be used with these parameters is publicly available as the "gap" 
program from Genetics Computer Group, Madison Wisconsin. The above parameters along 
with no penalty for end gap are the default parameters for peptide comparisons. 
25 Parameters for polynucleotide sequence comparison include the following: 

Algorithm: Needleman and Wunsch, J. Mol. Biol. 48:443-453 (1970) 

Comparison matrix: matches = +10; mismatches = 0 

Gap Penalty: 50 

Gap Length Penalty: 3 

30 A program which can be used with these parameters is publicly available as the "gap" 

program from Genetics Computer Group, Madison Wisconsin. The above parameters are the 
default parameters for nucleic acid comparisons. 

The invention also includes polypeptides of the formula: 
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wherein, at the amino terminus, X is hydrogen, and at the carboxyl terminus, Y is hydrogen 
or a metal, R, and R 3 are any amino acid residue, n is an integer between 0 and 1000, and R 2 
is an amino acid sequence of the invention, particularly an amino acid sequence selected from 
5 the group set forth in the Sequence Listing and preferably SEQ ID NOs: 3, 5, 7, 9, 74 and 76. 
In the formula, R 2 is oriented so that its amino terminal residue is at the left, bound to R l5 and 
its carboxy terminal residue is at the right, bound to R 3 . Any stretch of amino acid residues 
denoted by either R group, where R is greater than 1, may be either a heteropolymer or a 
homopolyrner, preferably a heteropolymer. 
1 0 Polypeptides of the present invention include isolated polypeptides encoded by a 

polynucleotide comprising a sequence selected from the group of a sequence contained in 
SEQ ID NOs: 2, 4, 6, 8, 73 and 75. 

The polypeptides of the present invention can be mature protein or can be part of a 
fusion protein. 

1 5 Fragments and variants of the polypeptides are also considered to be a part of the 

invention. A fragment is a variant polypeptide which has an amino acid sequence that is 
entirely the same as part but not all of the amino acid sequence of the previously described 
polypeptides. The fragments can be "free-standing" or comprised within a larger polypeptide 
of which the fragment forms a part or a region, most preferably as a single continuous region. 

20 Preferred fragments are biologically active fragments which are those fragments that mediate 
activities of the polypeptides of the invention, including those with similar activity or 
improved activity or with a decreased activity. Also included are those polypeptides and 
polypeptide fragments that are antigenic or immunogenic in an animal, particularly a human 
and antibodies, either polyclonal or monoclonal that specifically bind the antigenic fragments. 

25 In one preferred embodiment, such antigenic or immunogenic fragments comprise at least 1 0 
consecutive amino acids from the amino acid sequences disclosed herein or such sequences 
with at least one conservative amino acid substitution. In additional embodiments, such 
antigenic or immunogenic fragments comprise at least 15, at least 25, at least 50 or at least 
100 consecutive amino acids from the amino acid sequences disclosed herein or such 

30 sequences with at least one conservative amino acid substitution. Methods for the production 
of antibodies from polypeptides and polypeptides conjugated to carrier molecules are well 
known in the art and can be found for example in Ausubel et al., Short Protocols in 
Molecular Biology, 3 rd ed., Wiley & Sons, 1995, particularly chapter 11. 
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Variants of the polypeptide also include polypeptides that vary from the sequences set 
forth in the Sequence Listing by conservative amino acid substitutions, substitution of a 
residue by another with like characteristics. Those of ordinary skill in the art are aware that 
modifications in the amino acid sequence of a peptide, polypeptide, or protein can result in 
equivalent, or possibly improved, second generation peptides, etc., that display equivalent or 
superior functional characteristics when compared to the original amino acid sequence. The 
present invention accordingly encompasses such modified amino acid sequences. Alterations 
can include amino acid insertions, deletions, substitutions, truncations, fusions, shuffling of 
subunit sequences, and the like, provided that the peptide sequences produced by such 
modifications have substantially the same functional properties as the naturally occurring 
counterpart sequences disclosed herein. 

One factor that can be considered in making such changes is the hydropathic index of 
amino acids. The importance of the hydropathic amino acid index in conferring interactive 
biological function on a protein has been discussed by Kyte and Doolittle ( J. Mol. Biol., 157: 
105-132, 1982). It is accepted that the relative hydropathic character of amino acids 
contributes to the secondary structure of the resultant protein. This, in turn, affects the 
interaction of the protein with molecules such as enzymes, substrates, receptors, DNA, 
antibodies, antigens, etc. 

Based on its hydrophobicity and charge characteristics, each amino acid has been 
assigned a hydropathic index as follows: isoleucine (+4.5); valine (+4.2); leucine (+3.8); 
phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (- 
0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine 
(-3.2); glutamate/glutamine/aspartate/asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

As is known in the art, certain amino acids in a peptide or protein can be substituted 
for other amino acids having a similar hydropathic index or score and produce a resultant 
peptide or protein having similar biological activity, i.e., which still retains biological 
functionality. In making such changes, it is preferable that amino acids having hydropathic 
indices within ±2 are substituted for one another. More preferred substitutions are those 
wherein the amino acids have hydropathic indices within ±1. Most preferred substitutions are 
those wherein the amino acids have hydropathic indices within ±0.5. 

Like amino acids can also be substituted on the basis of hydrophilicity. U.S. Patent 
No. 4,554,101 discloses that the greatest local average hydrophilicity of a protein, as 
governed by the hydrophilicity of its adjacent amino acids, correlates with a biological 
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property of the protein. The following hydrophilicity values have been assigned to amino 
acids: arginine/lysine (+3.0); aspartate/glutamate (+3.0 ±1); serine (+0.3); 
asparagine/glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ±1); alanine/histidine 
(-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); leucine/isoleucine (-1.8); tyrosine (- 
5 2.3); phenylalanine (-2.5); and tryptophan (-£.4). Thus, one amino acid in a peptide, 
polypeptide, or protein can be substituted by another amino acid having a similar 
hydrophilicity score and still produce a resultant protein having similar biological activity, 
i.e., still retaining correct biological function. In making such changes, amino acids having 
hydropathic indices within ±2 are preferably substituted for one another, those within ±1 are 

1 0 more preferred, and those within ±0.5 are most preferred. 

As outlined above, amino acid substitutions in the peptides of the present invention 
can be based on the relative similarity of the amino acid side-chain substituents, for example, 
their hydrophobicity, hydrophilicity, charge, size, etc. Exemplary substitutions that take 
various of the foregoing characteristics into consideration in order to produce conservative 

15 amino acid changes resulting in silent changes within the present peptides, etc., can be 
selected from other members of the class to which the naturally occurring amino acid 
belongs. Amino acids can be divided into the following four groups: (1) acidic amino acids; 
(2) basic amino acids; (3) neutral polar amino acids; and (4) neutral non-polar amino acids. 
Representative amino acids within these various groups include, but are not limited to: (1) 

20 acidic (negatively charged) amino acids such as aspartic acid and glutamic acid; (2) basic 
(positively charged) amino acids such as arginine, histidine, and lysine; (3) neutral polar 
amino acids such as glycine, serine, threonine, cysteine, cystine, tyrosine, asparagine, and 
glutamine; and (4) neutral non-polar amino acids such as alanine, leucine, isoleucine, valine, 
proline, phenylalanine, tryptophan, and methionine. It should be noted that changes which 

25 are not expected to be advantageous can also be useful if these result in the production of 
functional sequences. 

Variants that are fragments of the polypeptides of the invention can be used to 
produce the corresponding full length polypeptide by peptide synthesis. Therefore, these 
variants can be used as intermediates for producing the full-length polypeptides of the 

30 invention. 

The polynucleotides and polypeptides of the invention can be used, for example, in 
the transformation of host cells, such as plant cells, animal cells, yeast cells, bacteria, 
bacteriophage, and viruses, as further discussed herein. 
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The invention also provides polynucleotides that encode a polypeptide that is a mature 
protein plus additional amino or carboxyl-tenninal amino acids, or amino acids within the 
mature polypeptide (for example, when the mature form of the protein has more than one 
polypeptide chain). Such sequences can, for example, play a role in the processing of a 
protein from a precursor to a mature form, aHow protein transport, shorten or lengthen protein 
half-life, or facilitate manipulation of the protein in assays or production. It is contemplated 
that cellular enzymes can be used to remove any additional amino acids from the mature 
protein. 

A precursor protein, having the mature form of the polypeptide fused to one or more 
prosequences may be an inactive form of the polypeptide. The inactive precursors generally 
are activated when the prosequences are removed. Some or all of the prosequences may be 
removed prior to activation. Such precursor protein are generally called proproteins. 

Preparation of Expression Constructs and Methods of Use 

Of interest is the use of the nucleotide sequences in recombinant DNA constructs to 
direct the transcription or transcription and translation (expression) of the acyltransferase 
sequences of the present invention in a host cell. Of particular interest is the use of the 
polynucleotide sequences of the present invention in recombinant DNA constructs to direct 
the transcription or transcription and translation (expression) of the acyltransferase sequences 
of the present invention in a host plant cell. 

The expression constructs generally comprise a regulatory sequence functional in a 
host cell operably linked to a nucleic acid sequence encoding a lecithimcholesterol 
acyltransferase-like polypeptide or acyl Co A: cholesterol acyltransferase-like polypeptide of 
the present invention and a transcriptional termination region functional in a host plant cell. 
Of particular interest is the use of promoters (also referred to as transcriptional initiation 
regions) functional in plant host cells. 

Those skilled in the art will recognize that there are a number of promoters which are 
functional in plant cells, and have been described in the literature including constitutive, 
inducible, tissue specific, organelle specific, developmentally regulated and environmentally 
regulated promoters. Chloroplast and plastid specific promoters, chloroplast or plastid 
functional promoters, and chloroplast or plastid operable promoters are also envisioned. 
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One set of promoters are constitutive promoters such as the CaMV35S or FMV35S 
promoters that yield high levels of expression in most plant organs. Enhanced or duplicated 
versions of the CaMV35S and FMV35S promoters are useful in the practice of this invention 
(Odell, et al. (1985) Nature 313:810-8 12; Rogers, U.S. Patent Number 5,378, 619). Other 
useful constitutive promoters include, but are not limited to, the mannopine synthase (mas) 
promoter, the nopaline synthase (nos) promoter, and the octopine synthase {pes) promoter. 

Useful inducible promoters include heat-shock promoters (Ou-Lee et al. (1986) Proc. 
Natl Acad. Sci. USA 83: 6815; Ainley et al. (1990) Plant Mol. Biol. 14: 949), a 
nitrate-inducible promoter derived from the spinach nitrite reductase gene (Back et al. (1991) 
Plant Mol. Biol. 17: 9), hormone-inducible promoters (Yamaguchi-Shinozaki et al. (1990) 
Plant Mol. Biol. 15: 905; Kares et al. (1990) Plant Mol. Biol. 15: 905), and light-inducible 
promoters associated with the small subunit of RuBP carboxylase and LHCP gene families 
(Kuhlemeier et al. (1989) Plant Cell 1: 471; Feinbaum et al. (1991) Mol. Gen. Genet. 226: 
449; Weisshaar et al. (1991) EMBOJ. 10: 1777; Lam and Chua (1990) Science 248: 471; 
Castresana et al. (1988) EMBOJ. 7: 1929; Schulze-Lefert et al. (1989) EMBO J. 8: 651). 

In addition, it may also be preferred to bring about expression of the acyltransferase 
gene in specific tissues of the plant, such as leaf, stem, root, tuber, seed, fruit, etc., and the 
promoter chosen should have the desired tissue and developmental specificity. Examples of 
useful tissue-specific, developmentally-regulated promoters include fruit-specific promoters 
such as the E4 promoter (Cordes et al. (1989) Plant Cell 1:1025), the E8 promoter (Deikman 
et al. (1988) EMBO J. 7: 3315), the kiwifruit actinidin promoter (Lin et al. (1993) PNAS 90: 
5939), the 2A1 1 promoter (Houck et al., U.S. Patent 4,943,674), and the tomato pZ130 
promoter (U.S. Patents 5,175, 095 and 5,530,185); the p-conglycinin 7S promoter (Doyle et 
al. (1986) J. Biol. Chem. 261: 9228; Slighton and Beachy (1987) Planta 172: 356), and 
seed-specific promoters (Knutzon et al. (1992) Proc. Natl. Acad. Sci. USA 89: 2624; Bustos 
et al. (1991) EMBO J. 10: 1469; Lam and Chua (1991) J. Biol. Chem. 266: 17131; Stayton et 
al. (1991) Aust. J. Plant. Physiol. 18: 507). Fruit-specific gene regulation is discussed in U.S. 
Patent 5,753,475. Other useful seed-specific promoters include, but are not limited to, the 
napin, phaseolin, zein, soybean trypsin inhibitor, 7S, ADR12, ACP, stearoyl-ACP desaturase, 
oleosin, Lasquerella hydroxylase, and barley aldose reductase promoters (Bartels (1995) 
Plant J. 7: 809-822), the EA9 promoter (U.S. Patent 5,420,034), and the Bce4 promoter (U.S. 
Patent 5,530,194). Useful embryo-specific promoters include the corn globulin 1 and oleosin 
promoters. Useful endosperm-specific promoters include the rice glutelin-1 promoter, the 
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promoters for the low-pi p amylase gene (Amy32b) (Rogers et al. (1984) J. Biol. Chem. 259: 
12234), the high-pi p amylase gene (Amy 64) (Khurseed et al. (1988) J. Biol. Chem. 263: 
18953), and the promoter for a barley thiol protease gene ("Aleurain") (Whittier et al. (1987) 
Nucleic Acids Res. 15:2515). 
5 Of particular interest is the expression of the nucleic acid sequences of the present 

invention from transcription initiation regions which are preferentially expressed in a plant 
seed tissue. Examples of such seed preferential transcription initiation sequences include 
those sequences derived from sequences encoding plant storage protein genes or from genes 
involved in fatty acid biosynthesis in oilseeds. Examples of such promoters include the 5' 
10 regulatory regions from such genes as napin (Kridl et al, Seed Sci. Res. 7:209:219 (1991)), 
phaseolin, zein, soybean trypsin inhibitor, ACP, stearoyl-ACP desaturase, soybean a' subunit 
of P-conglycinin (soy 7s, (Chen etal, Proc. Natl. Acad. Sci., 83:8560-8564 (1986))) and 
oleosin. Seed-specific gene regulation is discussed in EP 0 255 378 Bl and U.S. Patents 
5,420,034 and 5,608,152 . Promoter hybrids can also be constructed to enhance 
1 5 transcriptional activity (Hoffman, U.S. Patent No. 5, 106,739), or to combine desired 
transcriptional activity and tissue specificity. 

It may be advantageous to direct the localization of proteins conferring LCAT to a 
- particular subcellular compartment, for example, to the mitochondrion, endoplasmic 

reticulum, vacuoles, chloroplast or other plastidic compartment. For example, where the 
Z 20 genes of interest of the present invention will be targeted to plastids, such as chloroplasts, for 
expression, the constructs will also employ the use of sequences to direct the gene to the 
plastid. Such sequences are referred to herein as chloroplast transit peptides (CTP) or plastid 
transit peptides (PTP). In this manner, where the gene of interest is not directly inserted into 
the plastid, the expression construct will additionally contain a gene encoding a transit 
25 peptide to direct the gene of interest to the plastid. The chloroplast transit peptides may be 
derived from the gene of interest, or may be derived from a heterologous sequence having a 
CTP. Such transit peptides are known in the art. See, for example, Von Heijne et al. (1991) 
Plant Mol Biol. Rep. 9: 104-126; Clark etal. (1989) J. Biol Chem. 2(54:17544-17550; della- 
Cioppa et al. (1987) Plant Physiol. 54:965-968; Romer et al. (1993) Biochem. Biophys. Res 
30 Commun. 795:1414-1421; and, Shah et al. (1986) Science 253:478-481. 

Depending upon the intended use, the constructs may contain the nucleic acid 
sequence which encodes the entire LCAT protein, a portion of the LCAT protein, the entire 
AC AT protein, or a portion of the ACAT protein. For example, where antisense inhibition of 
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a given LCAT or ACAT protein is desired, the entire sequence is not required. Furthermore, 
where LCAT or ACAT sequences used in constructs are intended for use as probes, it maybe 
advantageous to prepare constructs containing only a particular portion of a LCAT or ACAT 
encoding sequence, for example a sequence which is discovered to encode a highly conserved 

5 region. *~ 

The skilled artisan will recognize that there are various methods for the inhibition of 
expression of endogenous sequences in a host cell. Such methods include, but are not limited 
to antisense suppression (Smith, et al. (1988) Nature 334:724-726) , co-suppression (Napoli, 
et al (1989) Plant Cell 2:279-289), ribozymes (PCT Publication WO 97/10328), and 

10 combinations of sense and antisense Waterhouse, et al. (1998) Proc. Natl. Acad. Sci. USA 
95 : 13959- 1 3964. Methods for the suppression of endogenous sequences in a host cell 
typically employ the transcription or transcription and translation of at least a portion of the 
sequence to be suppressed. Such sequences may be homologous to coding as well as non- 
coding regions of the endogenous sequence. 

1 5 Regulatory transcript termination regions may be provided in plant expression 

constructs of this invention as well. Transcript termination regions may be provided by the 
DNA sequence encoding the diacylglycerol acyltransferase or a convenient transcription 
termination region derived from a different gene source, for example, the transcript 
termination region which is naturally associated with the transcript initiation region. The 

20 skilled artisan will recognize that any convenient transcript termination region which is 

capable of terminating transcription in a plant cell may be employed in the constructs of the 
present invention. 

Alternatively, constructs may be prepared to direct the expression of the LCAT or 

ACAT sequences directly from the host plant cell plastid. Such constructs and methods are 
25 known in the art and are generally described, for example, in Svab, et al. (1990) Proc. Natl. 

Acad. Sci. USA 87:8526-8530 and Svab and Maliga (1993) Proc. Natl. Acad. Sci. USA 

90:913-917 and in U.S. Patent Number 5,693,507. 

A plant cell, tissue, organ, or plant into which the recombinant DNA constructs 

containing the expression constructs have been introduced is considered transformed, 
30 transfected, or transgenic. A transgenic or transformed cell or plant also includes progeny of 

the cell or plant and progeny produced from a breeding program employing such a transgenic 

plant as a parent in a cross and exhibiting an altered phenotype resulting from the presence of 

a LCAT nucleic acid sequence. 
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Plant expression or transcription constructs having a plant LCAT as the DNA 
sequence of interest for increased or decreased expression thereof may be employed with a 
wide variety of plant life, particularly, plant life involved in the production of vegetable oils 
for edible and industrial uses. Most especially preferred are temperate oilseed crops. Plants 

5 of interest include, but are not limited to, rapeseed (Canola and High Erucic Acid varieties), 
sunflower, safflower, cotton, soybean, peanut, coconut and oil palms, and corn. Depending 
on the method for introducing the recombinant constructs into the host cell, other DNA 
sequences may be required. Importantly, this invention is applicable to dicotyledyons and 
monocotyledons species alike and will be readily applicable to new and/or improved 

1 0 transformation and regulation techniques. 

Of particular interest, is the use of plant LCAT and AC AT constructs in plants to 
produce plants or plant parts, including, but not limited to leaves, stems, roots, reproductive, 
and seed, with a modified content of lipid and/or sterol esters and to alter the oil production 
by such plants. 

1 5 Of particular interest in the present invention, is the use of AC AT genes in 

conjunction with the LCAT sequences to increase the sterol content of seeds. Thus, 
overexpression of a nucleic acid sequence encoding an ACAT and LCAT in an oilseed crop 
may find use in the present invention to increase sterol levels in plant tissues and/or increase 
oil production. 

20 It is contemplated that the gene sequences may be synthesized, either completely or in 

part, especially where it is desirable to provide plant-preferred sequences. Thus, all or a 
portion of the desired structural gene (that portion of the gene which encodes the LCAT or 
ACAT protein) may be synthesized using codons preferred by a selected host. Host-preferred 
codons may be determined, for example, from the codons used most frequently in the proteins 

25 expressed in a desired host species. 

One skilled in the art will readily recognize that antibody preparations, nucleic acid 
probes (DNA and RNA) and the like may be prepared and used to screen and recover 
"homologous" or "related" sequences from a variety of plant sources. Homologous sequences 
are found when there is an identity of sequence, which may be determined upon comparison 

30 of sequence information, nucleic acid or amino acid, or through hybridization reactions 
between a known LCAT and a candidate source. Conservative changes, such as Glu/Asp, 
Val/Ile, Ser/Thr, Arg/Lys and Gln/Asn may also be considered in deteimining sequence 
homology. Amino acid sequences are considered homologous by as little as 25% sequence 
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identity between the two complete mature proteins. (See generally, Doolittle, R.F., OF URFS 
and ORFS (University Science Books, CA, 1986.) 

Thus, other LC ATs may be obtained from the specific sequences provided herein. 
Furthermore, it will be apparent that one can obtain natural and synthetic sequences, 
5 including modified amino acid sequences and starting materials for synthetic-protein 

modeling from the exemplified LCAT and ACAT sequences and from sequences which are 
obtained through the use of such exemplified sequences. Modified amino acid sequences 
include sequences which have been mutated, truncated, increased and the like, whether such 
sequences were partially or wholly synthesized. Sequences which are actually purified from 
10 plant preparations or are identical or encode identical proteins thereto, regardless of the 
method used to obtain the protein or sequence, are equally considered naturally derived. 

For immunological screening, antibodies to the protein can be prepared by injecting 
rabbits or mice with the purified protein or portion thereof, such methods of preparing 
antibodies being well known to those in the art. Either monoclonal or polyclonal antibodies 
1 5 can be produced, although typically polyclonal antibodies are more useful for gene isolation. 
Western analysis may be conducted to determine that a related protein is present in a crude 
extract of the desired plant species, as determined by cross-reaction with the antibodies to the 
encoded proteins. When cross-reactivity is observed, genes encoding the related proteins are 
isolated by screening expression libraries representing the desired plant species. Expression 
20 libraries can be constructed in a variety of commercially available vectors, including lambda 
gtl 1, as described in Sambrook, et al. (Molecular Cloning: A Laboratory Manual, Second 
Edition (1989) Cold Spring Harbor Laboratory, Cold Spring Harbor, New York). 

To confirm the activity and specificity of the proteins encoded by the identified 
nucleic acid sequences as acyltransferase enzymes, in vitro assays are performed in insect cell 
25 cultures using baculovirus expression systems. Such baculovirus expression systems are 

known in the art and are described by Lee, et al. U.S. Patent Number 5,348,886, the entirety 
of which is herein incorporated by reference. 

In addition, other expression constructs may be prepared to assay for protein activity 
utilizing different expression systems. Such expression constructs are transformed into yeast 
30 or prokaryotic host and assayed for acyltransferase activity. Such expression systems are 
known in the art and are readily available through commercial sources. 

The method of transformation in obtaining such transgenic plants is not critical to the 
instant invention, and various methods of plant transformation are currently available. 
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Furthermore, as newer methods become available to transform crops, they may also be 
directly applied hereunder. For example, many plant species naturally susceptible to 
Agrobacterium infection may be successfully transformed via tripartite or binary vector 
methods of Agrobacterium mediated transformation. In many instances, it will be desirable 
5 to have the construct bordered on one or both sides by T-DNA, particularly having the left 
and right borders, more particularly the right border. This is particularly useful when the 
construct uses A. tumefaciens ox A. rhizogenes as a mode for transformation, although the T- 
DNA borders may find use with other modes of transformation. In addition, techniques of 
microinjection, DNA particle bombardment, and electroporation have been developed which 
10 allow for the transformation of various monocot and dicot plant species. 

Normally, included with the DNA construct will be a structural gene having the 
necessary regulatory regions for expression in a host and providing for selection of 
transformant cells. The gene may provide for resistance to a cytotoxic agent, e.g. antibiotic, 
heavy metal, toxin, etc., complementation providing prototrophy to an auxotrophic host, viral 
1 5 immunity or the like. Depending upon the number of different host species the expression 
construct or components thereof are introduced, one or more markers may be employed, 
where different conditions for selection are used for the different hosts. 

Non-limiting examples of suitable selection markers include genes that confer 
resistance to bleomycin, gentamycin, glyphosate, hygromycin, kanamycin, methotrexate, 
20 phleomycin, phosphinotricin, spectinomycin, streptomycin, sulfonamide and sulfonylureas. 
Maliga et al., Methods in Plant Molecular Biology, Cold Spring Harbor Laboratory Press, 
1995, p. 39. Examples of markers include, but are not limited to, alkaline phosphatase (AP), 
myc, hemagglutinin (HA), p glucuronidase (GUS), luciferase, and green fluorescent protein 
(GFP). 

25 Where Agrobacterium is used for plant cell transformation, a vector may be used 

which may be introduced into the Agrobacterium host for homologous recombination with T- 
DNA or the Ti- or Ri-plasmid present in the Agrobacterium host. The Ti- or Ri-plasmid 
containing the T-DNA for recombination may be armed (capable of causing gall formation) 
or disarmed (incapable of causing gall formation), the latter being permissible, so long as the 

30 vir genes are present in the transformed Agrobacterium host. The armed plasmid can give a 
mixture of normal plant cells and gall. 

In some instances where Agrobacterium is used as the vehicle for transforming host 
plant cells, the expression or transcription construct bordered by the T-DNA border region(s) 
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will be inserted into a broad host range vector capable of replication in E. coli and 
Agrobacterium, there being broad host range vectors described in the literature. Commonly 
used is pRK2 or derivatives thereof. See, for example, Ditta, et al., (Proc. Nat. Acad. Set, 
U.S.A. (1980) 77:7347-7351) and EPA 0 120 515, which are incorporated herein by 
reference. Alternatively, one may insert the/sequences to be expressed in plant cells into a 
vector containing separate replication sequences, one of which stabilizes the vector in E. coli, 
and the other in Agrobacterium. See, for example, McBride and Summerfelt (Plant Mol. 
Biol. (1990) 14:269-216), wherein the pRiHRI (Jouanin, et al, Mol. Gen. Genet. (1985) 
201:370-374) origin of replication is utilized and provides for added stability of the plant 
expression vectors in host Agrobacterium cells. 

Included with the expression construct and the T-DNA can be one or more markers, 
which allow for selection of transformed Agrobacterium and transformed plant cells. A 
number of markers have been developed for use with plant cells, such as resistance to 
chloramphenicol, kanamycin, the aminoglycoside G418, hygromycin, or the like. The 
particular marker employed is not essential to this invention, one or another marker being 
preferred depending on the particular host and the manner of construction. 

For transformation of plant cells using Agrobacterium, explants may be combined and 
incubated with the transformed Agrobacterium for sufficient time for transformation, the 
bacteria killed, and the plant cells cultured in an appropriate selective medium. Once callus 
forms, shoot formation can be encouraged by employing the appropriate plant hormones in 
accordance with known methods and the shoots transferred to rooting medium for 
regeneration of plants. The plants may then be grown to seed and the seed used to establish 
repetitive generations and for isolation of vegetable oils. 

Thus, in another aspect of the present invention, methods for modifying the sterol 
and/or stanol composition of a host cell. Of particular interest are methods for modifying the 
sterol and/or stanol composition of a host plant cell, hi general the methods involve either 
increasing the levels of sterol ester compounds as a proportion of the total sterol compounds. 
The method generally comprises the use of expression constructs to direct the expression of 
the polynucleotides of the present invention in a host cell. 

Also provided are methods for reducing the proportion of sterol ester compounds as a 
percentage of total sterol compounds in a host plant cell. The method generally comprises the 
use of expression constructs to direct the suppression of endogenous acyltransferase proteins 
in a host cell. 
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Of particular interest is the use of expression constructs to modify the levels of sterol 
compounds in a host plant cell. Most particular, the methods find use in modifying the levels 
of sterol compounds in seed oils obtained from plant seeds. 

Also of interest is the use of expression constructs of the present invention to alter oil 
production in a host cell and in particular to increase oil production. Of particular interest is 
the use of expression constructs containing nucleic acid sequences encoding LCAT and/or 
ACAT polypeptides to transform host plant cells and to use these host cells to regenerate 
whole plants having increase oil production as compared to the same plant not containing the 
expression construct. 

The oils obtained from transgenic plants having modified sterol compound content 
find use in a wide variety of applications. Of particular interest in the present invention is the 
use of the oils containing modified levels of sterol compounds in applications involved in 
improving human nutrition and cardiovascular health. For example, phytostanols are 
beneficial for lowering serum cholesterol (Ling, et al. (1995) Life Sciences 57:195-206). 

Cholesterol-lowering compositions comprise the oils and sterol ester compound 
compositions obtained using the methods of the present invention. Such cholesterol lowering 
compositions include, but are not limited to foods, food products, processed foods, food 
ingredients, food additive compositions, or dietary/nutritional supplements that contain oils 
and/or fats. Non-limiting examples include margarines; butters; shortenings; cooking oils; 
frying oils; dressings, such as salad dressings; spreads; mayonnaises; and vitamin/mineral 
supplements. Patent documents relating to such compositions include, U.S. Patents 
4,588,717 and 5,244,887, and PCT International Publication Nos. WO 96/38047, WO 
97/42830, WO 98/06405, and WO 98/06714. Additional non-limiting examples include 
toppings; dairy products such as cheese and processed cheese; processed meat; pastas; 
sauces; cereals; desserts, including frozen and shelf-stable desserts; dips; chips; baked 
goods; pastries; cookies; snackbars; confections; chocolates; beverages; unextracted seed; 
and unextracted seed that has been ground, cracked, milled, rolled, extruded, pelleted, 
defatted, dehydrated, or otherwise processed, but which still contains the oils, etc., 
disclosed herein. 

The cholesterol-lowering compositions can also take the form of pharmaceutical 
compositions comprising a cholesterol-lowering effective amount of the oils or sterol 
compound compositions obtained using the methods of the present invention, along with a 
pharmaceuticaHy acceptable carrier, excipient, or diluent. These pharmaceutical 
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compositions can be in the form of a liquid or a solid. Liquids can be solutions or 
suspensions; solids can be in the form of a powder, a granule, a pill, a tablet, a gel, or an 
extradate. U.S. Patent 5,270,041 relates to sterol-containing pharmaceutical compositions. 

Thus, by expression of the nucleic acid sequences encoding acyltransferase-like 
sequences of the present invention in a host cell, it is possible to modify the lipid content 
and/or composition as well as the sterol content and/or composition of the host cell. 

The invention now being generally described, it will be more readily understood by 
reference to the following examples which are included for purposes of illustration only and 
are not intended to limit the present invention. 

EXAMPLES 

Example 1: RNA Isolations 

Total RNA from the inflorescence and developing seeds of Arabidopsis thaliana was 
isolated for use in construction of complementary (cDNA) libraries. The procedure was an 
adaptation of the DNA isolation protocol of Webb and Rnapp (DM. Webb and SJ. Knapp, 
(1990) Plant Molec. Reporter, 8, 180-185). The following description assumes the use of lg 
fresh weight of tissue. Frozen seed tissue was powdered by grinding under liquid nitrogen. 
The powder was added to 10ml REC buffer (50mM Tris-HCl, pH 9, 0.8M NaCl, lOmM 
EDTA, 0.5% w/v CTAB (cetyltrimethyl-ammonium bromide)) along with 0.2g insoluble 
polyvinylpolypyrrolidone, and ground at room temperature. The homogenate was 
centrifuged for 5 minutes at 12,000 xg to pellet insoluble material. The resulting supernatant 
fraction was extracted with chloroform, and the top phase was recovered. 

The RNA was then precipitated by addition of 1 volume RecP (50mM Tris-HCL pH9, 
lOmM EDTA and 0.5% (w/v) CTAB) and collected by brief centrifugation as before. The 
RNA pellet was redissolved in 0.4 ml of 1M NaCl. The RNA pellet was redissolved in water 
and extracted with phenol/chloroform. Sufficient 3M potassium acetate (pH 5) ws added to 
make the mixture 0.3M in acetate, followed by addition of two volumes of ethanol to 
precipitate the RNA. After washing with ethanol, this final RNA precipitate was dissolved in 
water and stored frozen. 



31 MTC6718 
PATENT 

Alternatively, total RNA may be obtained using TRIzol reagent (BRL- 
Lifetechnologies, Gaithersburg, MD) following the manufacturer's protocol. The RNA 
precipitate was dissolved in water and stored frozen. 

Example 2: Identification of LCAT Sequences 

Searches were performed on a Silicon Graphics Unix computer using additional 
Bioaccellerator hardware and GenWeb software supplied by Compugen Ltd. This software 
and hardware enabled the use of the Smith- Waterman algorithm in searching DNA and 
protein databases using profiles as queries. The program used to query protein databases was 
profilesearch. This is a search where the query is not a single sequence but a profile based on 
a multiple alignment of amino acid or nucleic acid sequences. The profile was used to query a 
sequence data set, i.e., a sequence database. The profile contained all the pertinent 
information for scoring each position in a sequence, in effect replacing the "scoring matrix" 
used for the standard query searches. The program used to query nucleotide databases with a 
protein profile was tprofilesearch. Tprofilesearch searches nucleic acid databases using an 
amino acid profile query. As the search is running, sequences in the database are translated to 
amino acid sequences in six reading frames. The output file for tprofilesearch is identical to 
the output file for profilesearch except for an additional column that indicates the frame in 
which the best alignment occurred. 

The Smith-Waterman algorithm, (Smith and Waterman (1981) J. Molec. Biol, 
147:195-197), was used to search for similarities between one sequence from the query and a 
group of sequences contained in the database. 

A protein sequence of Lecithin: cholesterol acyltransferase from human (McLean J, et 
al. (1986) Nucleic Acids Res. 14(23):9397-406 SEQ ID NO:l)) was used to search the NCBI 
non-redundant protein database using BLAST. Three sequences were identified from 
Arabidopsis, GenBank accessions AC004557 ( referred to herein as LCAT1, SEQ ID NO:2), 
AC003027 (referred to herein as LCAT2, SEQ ID NO:4), and AL024486 (referred to herein 
as LCAT3, SEQ ID NO:6). The deduced amino acid sequences are provided in SEQ ID NOs: 
3, 5, and 7, respectively. 

The profile generated from the queries using PSI-BLAST was excised from the hyper 
text markup language (html) file. The worldwide web (www)/html interface to psiblast at 
ncbi stores the current generated profile matrix in a hidden field in the html file that is 
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returned after each iteration of psiblast. However, this matrix has been encoded into string62 
(s62) format for ease of transport through html. String62 format is a simple conversion of the 
values of the matrix into html legal ascii characters. 

The encoded matrix width (x axis) is 26 characters, and comprise the consensus 
5 characters, the probabilities of each ainino aeid in the order A,B>C,D,E,F,G,H,I,K,L,M,N, 
P,Q,R,S,T,V,W,X,Y,Z (where B represents D and N, and Z represents Q and E, and X 
represents any arriino acid), gap creation value, and gap extension value. 

The length (y axis) of the matrix corresponds to the length of the sequences identified 
by PSI-BLAST. The order of the amino acids corresponds to the conserved amino acid 
10 sequence of the sequences identified using PSI-BLAST, with the N-terminal end at the top of 
the matrix. The probabilities of other amino acids at that position are represented for each 
amino acid along the x axis, below the respective single letter amino acid abbreviation. 

Thus, each row of the profile consists of the highest scoring (consensus) amino acid, 
followed by the scores for each possible amino acid at that position in sequence matrix, the 
1 5 score for opening a gap that that position, and the score for continuing a gap at that position. 

The string62 file is converted back into a profile for use in subsequent searches. The 
gap open field is set to 1 1 and the gap extension field is set to 1 along the x axis. The gap 
creation and gap extension values are known, based on the settings given to the PSI-BLAST 
algorithm. The matrix is exported to the standard GCG profile form. This format can be read 
20 by GenWeb. 

The algorithm used to convert the string62 formatted file to the matrix is outlined in 
Table 1. 
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Table 1 

1. if encoded character z then the value is blast score min 

2. if encoded character Z then the value is blast score max 

3. else if the encoded character is uppercase^then its value is (64-(ascii # of char)) 

4. else if the encoded character is a digit the value is ((ascii # of char)-48) 

5. else if the encoded character is not uppercase then the value is ((ascii # of char) - 87) 

6. ALL B positions are set to min of D and N amino acids at that row in sequence matrix 

7. ALL Z positions are set to min of Q amd E amino acids at that row in sequence matrix 

8. ALL X positions are set to min of all amino acids at that row in sequence matrix 

9. kBLAST_SCORE_MAX=999; 

10. kBLAST_SCORE_MTN=-999; 

1 1 . all gap opens are set to 1 1 

12. all gap lens are set to 1 



The protein sequences of LCAT1 , LCAT2, and LCAT3 as well as the PSI-BLAST 
profile were used to search public and proprietary databases for additional LC AT sequences. 
Two EST sequences were identified which appear to be identical to LCAT1 and LCAT3, 
respectively. One additional Arabidopsis sequence was identified from the proprietary 
databases, LCAT4 (SEQ ID NO:8). The deduced protein sequence of LCAT4 is provided in 
SEQ ID NO:9. Two additional genomic sequences were identified using the PSI-BLAST 
profile from libraries of Arabidopsis ecotypes Columbia and Landsberg, LCAT7 (SEQ ID 
NO: 10) and LCAT8 (SEQ ID NO:l 1). The LCAT7 sequence was present in both the 
Columbia and Landsberg genomic libraries, while the LCAT8 sequence was only present in 
the Columbia library. 

An open reading frame was predicted from the genomic sequence of LCAT7 in the 
Arabidopsis public database and this sequence was called MSH12 (referred to herein as 
LCAT5, SEQ ID NO: 73). The deduced protein sequence of LCAT5 is provided in SEQ ID 
NO: 74. 

The PSI-BLAST profile and the LCAT sequences were used to query the public yeast 
database and proprietary libraries containing corn and soy EST sequences. The yeast genome 
contains only one gene, LROl (LCAT Related Open reading frame, YNR008W, Figure 1) 
with distinct similarity to the human LCAT. The DNA sequence of LROl is provided in SEQ 
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ED NO: 75 and the protein sequence is provided in SEQ ID NO: 76. Seven EST sequences 
were identified from soybean libraries as being LCAT sequences. Two sequences from soy 
(SEQ ID NOs: 12 and 13) are most closely related to the Arabidopsis LCAT1 sequence, a 
single sequence was identified as being most closely related to LCAT2 (SEQ ID NO: 14) , 
5 three were closely related to LCAT3 (SEQ ID NOs: 15-17), and an additional single sequence 
was identified (SEQ ID NO: 18). A total of 1 1 corn EST sequences were identified as being 
related to the Arabidopsis LCAT sequences. Two corn EST sequences (SEQ ID NOs: 19 and 
20) were most closely related to LCAT1, two sequences were identified as closely related to 
LCAT2 (SEQ ID NOs: 21 and 22), four corn EST sequences were identified as closely 
10 related to LCAT3 (SEQ ID NOs: 23-26), and an additional three corn EST sequences were 
also identified (SEQ ID NOs: 27-29). 

Example 3: Identification of ACAT Sequences 

1 5 Since plant ACATs are unknown in the art, searches were performed to identify 

known and related ACAT sequences from mammalian sources from public databases. These 
sequences were then used to search public and proprietary EST databases to identify plant 
ACAT-like sequences. 

A public database containing mouse Expressed Sequence Tag (EST) sequences 

20 (dBEST) was searched for ACAT-like sequences. The search identified two sequences (SEQ 
ID 30 and 31) which were related (approximately 20% identical), but divergent, to known 
ACAT sequences. 

In order to identify ACAT-like sequences from other organisms, the two mouse 
ACAT sequences were used to search public and proprietary databases containing EST 

25 sequences from human and rat tissues. Results of the search identified several sequences 
from the human database and from the rat database which were closely related to the mouse 
sequences. The human and rat ACAT-like EST sequences were assembled, using the GCG 
assembly program, to construct a complete inferred cDNA sequence by identifying 
overlapping sequences (SEQ ID NOs: 32 and 33, respectively). 

30 The protein sequence of the human ACAT-like sequence was aligned with known 

ACAT sequences from human (Chang, et al. (1993) J. Biol. Chem. 268:20747-20755, SEQ 
ID NO:34), mouse (Uelmen, et al. (1995) J. Biol. Chem. 270:26192-26201 SEQ tD NO:35) 
and yeast (Yu, et al. (1996) J. Biol. Chem. 271:24157-24163, SEQ ID NO:36 and Yang, et al. 
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(1996) Science 272:1353-1356, SEQ ID NO:37) using MacVector (Oxford Molecular, Inc.). 
Results of the alignment demonstrated that the sequence was related to the known sequences, 
however the related sequence was only about 25% similar to the known sequences. 
The protein sequence of the human sterol O-acyltransferase (AC AT, Acyl 
5 CoA:Cholesterol acyltransferase, Accession/number A48026) related sequence was used to 
search protein and nucleic acid Genbank databases. A single plant homologue was identified 
in the public Arabidopsis EST database (Accession A042298, SEQ ID NO:38). The protein 
sequence (SEQ ID NO:39)was translated from the EST sequence, and was found to contain a 
peptide sequence conserved in both mammalian and yeast ACATs (Chang et al, (1997) Ann. 
10 Rev. Biochem., 66:613-638). 

To obtain the entire coding region corresponding to the Arabidopsis ACAT-like EST, 
synthetic oligo-nucleotide primers were designed to amplify the 5' and 3' ends of partial 
cDNA clones containing ACAT-like sequences. Primers were designed according to the 
Arabidopsis ACAT-like EST sequence and were used in Rapid Amplification of cDNA Ends 
15 (RACE) reactions (Frohman et al. (1988) Proc. Natl Acad. Set USA 85:8998-9002). 

Primers were designed (5 '-TGCAAATTGACGAGCACACCAACCCCTTC-3 ' (SEQ 
ID NO:40) and 5 '-AAGGATGCTTTGAGTTCCTGACAATAGG-3 ' (SEQ ID NO:41)) to 
amplify the 5' end from the Arabidopsis AC AT EST sequence. Amplification of flanking 
sequences from cDNA clones were performed using the Marathon cDNA Amplification kit 
20 (Clontech, CA). 

The sequence derived from the 5 '-RACE amplification was used to search proprietary 
Arabidopsis EST libraries. A single EST accession, LIB25-088-C7 (SEQ ID NO:42), was 
identified which contained a sequence identical to the 5 '-RACE sequence. Furthermore, 
LIB25-088-C7 was found to contain the complete putative coding sequence for the 
25 Arabidopsis ACAT-like product. 

The nucleic acid as well as the putative translation product sequences of A042298 
were used to search public and proprietary databases. Four EST sequences were identified in 
both soybean (SEQ ID NOs:43-46) and maize (SEQ ID NOs:47-50) proprietary databases, 
and a single ACAT-like sequence was identified from Mortierrella alpina EST sequences 
30 (SEQIDNO:51). 

Sequence alignments between ACAT sequences from several different sources were 
compared to identify the similarity between the sequences. Nucleotide sequences from 
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known human and mouse ACATs, as well as nucleotide sequences from known yeast ACATs 
were compared to the ACAT-like EST sequences from human and Arabidopsis. 

Analysis of the sequence alignments revealed several classes of ACATs based on 
sequence similarity. The known human and mouse ACATs, being 88% similar in the 
nucleotide sequence, formed one class of AGATs. Another class of ACATs included the 
yeast ACATs which are less than 20% similar to the known human and mouse class ACATs. 

The final class of ACATs included the Arabidopsis and human sequences disclosed in 
the present invention. This class is approximately 22% similar to the known human and 
mouse ACAT class and approximately 23% similar to the yeast class of ACATs. Thus, the 
ACAT sequences disclosed in the present invention represent a novel class of ACAT 
enzymes. Partial mouse sequences of this class are also provided. 

Example 4: Expression Construct Preparation 

Constructs were prepared to direct expression of the LCAT1, LCAT2, LCAT3, 
LCAT4, LCAT5 and the yeast LROl sequences in plants and cultured insect cells. The entire 
coding region of each LCAT was amplified from the appropriate EST clone or an 
Arabidopsis genomic cDNA library using the following oligonucleotide primers in a 
polymerase chain reactions (PCR). The LCAT1 coding sequence was amplified from the 
EST clone Lib25-082-Ql-El-G4 using the primers 

5 ' -GGATCCGCGGCCGC AC AATGAAAAAAATATCTTC ACATT ATTCGG-3 ' (SEQ ID 
NO:52) and 5 ' -GGATCCCCTGC AGGTC ATTC ATTGACGGC ATTAACATTGG-3 ' (SEQ 
ID NO:53). The LCAT2 coding sequence was amplified from an Arabidopsis genomic 
cDNA library using the synthetic oligo nucleotide primers 

5 ' -GGATCCGCGGCCGC ACAATGGGAGCGAATTCGAAATCAGTAACG-3 ' (SEQ ID 
NO:54) and 5 '-GGATCCCCTGCAGGTTAATACCCACTTTT ATCAAGCTCCC-3 ' (SEQ 
ID NO:55). The LCAT3 coding sequence was amplified from the EST clone LIB22-004-Q1- 
E1-B4 using the synthetic oligo nucleotide primers 

5'-GGATCCGCGGCCGCACAATGTCTCTATTACTGGAA GAGATC-3' (SEQ ID NO:56) 
and 5 ' -GGATCCCCTGC AGGTTATGCATC AAC AGAGAC ACTTAC AGC-3 ' (SEQ ID 
NO:57) . The LCAT4 coding sequence was amplified from the EST clone LIB23-007-Q1- 
E1-B5 using the synthetic oligo nucleotide primers 
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5'-GGATCCGCGGCCGCACAATGGGCTGGATTCCGTGTCCGTGC-3' (SEQ ID NO:58) 
and 5 '-GGATCCCCTGCAGGTTAACCAGAATCAACTACTTTGTG-3 ' (SEQ ID NO:59). 
The LCAT5 coding sequence was amplified from LIB23-053-Q1-E1-E3 using the synthetic 
oligo nucleotide primers 
5 5 '-GGATCCGCGGCCGCACAATGCCCQTTATTCATCGG-3 ' (SEQ ID NO:77) and 5'- 
GGATCCCCTGCAGGTCACAGCTTCAGGTCAATACG-3' (SEQ ID NO:78). 

The yeast LROI coding sequence was amplified from genomic yeast DNA using the 
synthetic oligo nucleotide primers 

5 ' GGATCCGCGGCCGC ACAATGGGC ACACTGTTTCGAAG3 ' (SEQ ID NO:79) 
10 and 5 'GGATCCCCTGCAGGTTAC ATTGGGAAGGGCATCTGAG3 ' (SEQ ID NO:80). 

The entire coding region of the Arabidopsis ACAT sequence (SEQ ID NO: 42) was 
amplified from the EST clone LIB25-088-C7 using oligonucleotide primers 
5'-TCGACCTGCAGGAAGCTTAGAAATGGCGATTTTGGATTC-3' (SEQ ID NO: 60) 
and 5 ' -GGATCCGCGGCCGCTC ATGAC ATCGATCCTTTTCGG-3 ' (SEQ ID NO: 61) in a 
1 5 polymerase chain reaction (PCR). 

Each resulting PCR product was subcloned into pCR2.1Topo (Invitrogen) and labeled 
pCGN9964 (LCAT1), pCGN9985 (LCAT2), pCGN9965 (LCAT3), pCGN9995 (LCAT4), 
pCGN10964 (LCAT5), pCGN10963 (LROI), and pCGN8626 (ACAT). Double stranded 
DNA sequence was obtained to verify that no errors were introduced by the PCR 
20 amplification. 

4A. Baculovirus Expression Constructs 

Constructs are prepared to direct the expression of the Arabidopsis LCAT and yeast 
25 LCAT sequences in cultured insect cells. The entire coding region of the LCAT proteins was 
removed from the respective constructs by digestion with Noil and &e8387I, followed by gel 
electrophoresis and gel purification. The fragments containing the LCAT coding sequences 
were cloned into Notl and Pstl digested baculovirus expression vector pFastBacl (Gibco- 
BRL, Gaithersburg, MD). The resulting baculovirus expression constructs were referred to as 
30 pCGN9992 (LCAT1), pCGN9993 (LCAT2), pCGN9994 (LCAT3), pCGN10900 (LCAT4), 
pCGN10967 (LCAT5), and pCGN10962 (LROI). 
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4B. Plant Expression Construct Preparation 

A plasmid containing the napin cassette derived from pCGN3223 (described in U.S. 
Patent No. 5,639,790, the entirety of which is incorporated herein by reference) was modified 

5 to make it more useful for cloning large DNA fragments containing multiple restriction sites, 
and to allow the cloning of multiple napin fusion genes into plant binary transformation 
vectors. An adapter comprised of the self annealed oligonucleotide of sequence 5'- 
CGCGATTTAAATGGCGCGCCCTGCAGGCGGCCGCCTGCAGGGCGCGCCATTTAA 
AT-3' (SEQ ID NO:62) was ligated into the cloning vector pBC SK+ (Stratagene) after 

1 0 digestion with the restriction endonuclease BssHH to construct vector pCGN7765 . Plamids 
pCGN3223 and pCGN7765 were digested with NotI and ligated together. The resultant 
vector, pCGN7770, contained the pCGN7765 backbone with the napin seed specific 
expression cassette from pCGN3223. 

The cloning cassette, pCGN7787, contained essentially the same regulatory elements 

1 5 as pCGN7770, with the exception of the napin regulatory regions of pCGN7770 have been 
replaced with the double CAMV 35S promoter and the tail polyadenylation and 
transcriptional termination region. 

A binary vector for plant transformation, pCGN5 139, was constructed from 
pCGN1558 (McBride and Summerfelt, (1990) Plant Molecular Biology, 14:269-276). In 

20 pCGN5 139, the polylinker of pCGNl 558 was replaced as a Hindin/Asp71 8 fragment with a 
polylinker containing unique restriction endonuclease sites, AscI, Pad, Xbal, Swal, 
BamHI,and Not! The Asp718 and HindTA restriction endonuclease sites are retained in 
pCGN5139. 

A series of turbo binary vectors was constructed to allow for the rapid cloning of 
25 DNA sequences into binary vectors containing transcriptional initiation regions (promoters) 
and transcriptional termination regions. 

The plasmid pCGN8618 was constructed by ligating oligonucleotides 
5 '-TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGG-3 ' (SEQIDNO:63) and 
5 '-TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC-3 ' (SEQ ID NO:64) into 
30 Sall/XhoI-digested pCGN7770. A fragment containing the napin promoter, polylinker and 
napin 3' region was excised from pCGN8618 by digestion with Asp718I; the fragment was 
blunt-ended by filling in the 5' overhangs with Klenow fragment then ligated into pCGN5139 
that had been digested with Asp718I and Hindm and blunt-ended by filling in the 5' 
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overhangs with Klenow fragment. A plasmid containing the insert oriented so that the napin 
promoter was closest to the blunted Asp718I site of pCGN5139 and the napin 3' was closest 
to the blunted HindHI site was subjected to sequence analysis to confirm both the insert 
orientation and the integrity of cloning junctions. The resulting plasmid was designated 
pCGN8622. >- 

The plasmid pCGN8619 was constructed by ligating oligonucleotides 
5 '-TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC-3 ' (SEQIDNO:65) and 
5 '-TCGAGGATCCGCGGCCGCAAGCTTCCTGC AGG-3 ' (SEQIDNO:66) into 
Sall/XhoI-digested pCGN7770. A fragment containing the napin promoter, polylinker and 
napin 3' region was removed from pCGN8619 by digestion with Asp718I; the fragment was 
blunt-ended by filling in the 5' overhangs with Klenow fragment then ligated into pCGN5139 
that had been digested with Asp718I and Hindm and blunt-ended by filling in the 5' 
overhangs with Klenow fragment. A plasmid containing the insert oriented so that the napin 
promoter was closest to the blunted Asp718I site of pCGN5139 and the napin 3' was closest 
to the blunted HindHI site was subjected to sequence analysis to confirm both the insert 
orientation and the integrity of cloning junctions. The resulting plasmid was designated 
pCGN8623. 

The plasmid pCGN8620 was constructed by ligating oligonucleotides 
5'-TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGGAGCT -3' (SEQ ID NO:67) and 
5'-CCTGCAGGAAGCTTGCGGCCGCGGATCC-3' (SEQ ID NO:68) into Sall/Sacl- 
digested pCGN7787. A fragment containing the d35S promoter, polylinker and tml 3' region 
was removed from pCGN8620 by complete digestion with Asp718I and partial digestion with 
Not! The fragment was blunt-ended by filling in the 5' overhangs with Klenow fragment 
then ligated into pCGN5 139 that had been digested with Asp71 81 and Hindm and blunt- 
ended by filling in the 5' overhangs with Klenow fragment. A plasmid containing the insert 
oriented so that the d35S promoter was closest to the blunted Asp718I site of pCGN5139 and 
the tml 3' was closest to the blunted HindHI site was subjected to sequence analysis to 
confirm both the insert orientation and the integrity of cloning junctions. The resulting 
plasmid was designated pCGN8624. 

The plasmid pCGN8621 was constructed by ligating oligonucleotides 
5 '-TCGACCTGC AGGAAGCTTGCGGCCGCGGATCCAGCT-3 ' (SEQIDNO:69) and 
5'-GGATCCGCGGCCGCAAGCTTCCTGCAGG-3' (SEQIDNO:70) into Sall/Sacl- 
digested pCGN7787. A fragment containing the d35S promoter, polylinker and tml 3' region 
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was removed from pCGN8621 by complete digestion with Asp718I and partial digestion with 
NotL The fragment was blunt-ended by filling in the 5' overhangs with Klenow fragment 
then ligated into pCGN5 139 that had been digested with Asp71 81 and HindlH and blunt- 
ended by filling in the 5' overhangs with Klenow fragment. A plasmid containing the insert 
oriented so that the d35S promoter was closest to the blunted Asp718I site of pCGN5139 and 
the tml 3' was closest to the blunted HindHI site was subjected to sequence analysis to 
confirm both the insert orientation and the integrity of cloning junctions. The resulting 
plasmid was designated pCGN8625. 

The plasmid construct pCGN8640 is a modification of pCGN8624 described above. 
A 938bp PstI fragment isolated from transposon Tn7 which encodes bacterial spectinomycin 
and streptomycin resistance (Fling et al. (1985), Nucleic Acids Research 13(19):7095-7106), 
a determinant for E. coli and Agrobacterium selection, was blunt ended with Pfu polymerase. 
The blunt ended fragment was ligated into pCGN8624 that had been digested with Spel and 
blunt ended with Pfu polymerase. The region containing the PstI fragment was sequenced to 
confirm both the insert orientation and the integrity of cloning junctions. 

The spectinomycin resistance marker was introduced into pCGN8622 and pCGN8623 
as follows. A 7.7 Kbp AvrE-SnaBI fragment from pCGN8640 was ligated to a 10.9 Kbp 
AvrH-SnaBI fragment from pCGN8623 or pCGN8622, described above. The resulting 
plasmids were pCGN8641 and pCGN8643, respectively. 

The plasmid pCGN8644 was constructed by ligating oligonucleotides 
5 ' -GATC ACCTGC AGGAAGCTTGCGGCCGCGGATCC AATGC A-3 ' (SEQ ID NO:71) 
and 5'-TTGGATCCGCGGCCGCAAGCTTCCTGCAGGT-3' (SEQ ID NO:72) into BamHI- 
Pstl digested pCGN8640. 

4C. Plant LCAT Expression Construct Preparation 

The coding sequence of LCAT1 was cloned from pCGN9964 as a NotV &e8387I 
fragment into pCGN8640, pCGN8641, pCGN8643, and pCGN8644 to create the expression 
constructs pCGN9960, pCGN9961, pCGN9962, and pCGN9963, respectively. The construct 
pCGN9960 was designed to express the LCAT1 coding sequence in the sense orientation 
from the constitutive promoter CaMV 35S. The construct pCGN9961 was designed to 
express the LCAT1 coding sequence in the antisense orientation from the napin promoter. 
The construct pCGN9962 was designed to express the LCAT1 coding sequence in the sense 
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orientation from the napin promoter. The construct pCGN9963 was designed to express the 
LCAT1 coding sequence in the antisense orientation from the constitutive promoter CaMV 
35S. 

The coding sequence of LCAT2 was cloned from pCGN9985 as a NotV SseS3Zll 
fragment into pCGN8640, pCGN8641, pCGN8643, and pCGN8644 to create the expression 
constructs pCGN9981, pCGN9982, pCGN9983, and pCGN9984, respectively. The construct 
pCGN9981 was designed to express the LCAT2 coding sequence in the sense orientation 
from the constitutive promoter CaMV 35S. The construct pCGN9982 was designed to 
express the LC AT2 coding sequence in the antisense orientation from the napin promoter. 
The construct pCGN9983 was designed to express the LCAT2 coding sequence in the sense 
orientation from the napin promoter. The construct pCGN9984 was designed to express the 
LCAT2 coding sequence in the antisense orientation from the constitutive promoter CaMV 
35S. 

The coding sequence of LCAT3 was cloned from pCGN9965 as a NotV &e8387I 
fragment into pCGN8640, pCGN8641, pCGN8643, and pCGN8644 to create the expression 
constructs pCGN9966, pCGN9967, pCGN9968, and pCGN9969, respectively. The construct 
pCGN9966 was designed to express the LCAT3 coding sequence in the sense orientation 
from the constitutive promoter CaMV 35S. The construct pCGN9967 was designed to 
express the LCAT3 coding sequence in the antisense orientation from the napin promoter. 
The construct pCGN9968 was designed to express the LCAT3 coding sequence in the sense 
orientation from the napin promoter. The construct pCGN9969 was designed to express the 
LCAT3 coding sequence in the antisense orientation from the constitutive promoter CaMV 
35S. 

The coding sequence of LCAT4 was cloned from pCGN9995 as a NotV &e8387I 
fragment into pCGN8640, pCGN8641, pCGN8643, and pCGN8644 to create the expression 
constructs pCGN9996, pCGN9997, pCGN9998, and pCGN9999, respectively. The construct 
pCGN9996 was designed to express the LCAT4 coding sequence in the sense orientation 
from the constitutive promoter CaMV 35S. The construct pCGN9997 was designed to 
express the LCAT4 coding sequence in the antisense orientation from the napin promoter. 
The construct pCGN9998 was designed to express the LCAT4 coding sequence in the sense 
orientation from the napin promoter. The construct pCGN9999 was designed to express the 
LCAT4 coding sequence in the antisense orientation from the constitutive promoter CaMV 
35S. 
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The coding sequence of LCAT5 was cloned from pCGN10964 as a No/I/ &«8387I 
fragment into pCGN9977 and pCGN9979, to create the expression constructs pCGN10965, 
and pCGN10966, respectively. The construct pCGN10965 was designed to express the 
LCAT5 coding sequence in the sense orientation from the constitutive promoter CaMV 35S. 
The construct pCGN10966 was designed to^xpress the LCAT5 coding sequence in the sense 
orientation from the napin promoter. 

The coding sequence of LROl was cloned from pCGN10963 as a NotV &e8387I 
fragment into pCGN9977 and pCGN9979, to create the expression constructs pCGN10960, 
and pCGN10961, respectively. The construct pCGN10960 was designed to express the 
LROl coding sequence in the sense orientation from the constitutive promoter CaMV 35S. 
The construct pCGN10961 was designed to express the LROl coding sequence in the sense 
orientation from the napin promoter. 

4D. Plant ACAT Expression Construct Preparation 

A fragment containing the Arabidopsis ACAT-like coding region was removed from 
pCGN8626 by digestion with Sse8387I and Not I. The fragment containing the ACAT-like 
sequence was ligated into Pstl-Not I digested pCGN8622. The resulting plasmid was 
designated pCGN8627. DNA sequence analysis confirmed the integrity of the cloning 
junctions. 

A fragment containing the Arabidopsis ACAT-like coding region (SEQ ID NO: 42) 
was removed from pCGN8626 by digestion with Sse8387I and Not I. The fragment was 
ligated into Pstl-Not I digested pCGN8623. The resulting plasmid was designated 
pCGN8628. DNA sequence analysis corifirmed the integrity of the cloning junctions. 

A fragment containing the Arabidopsis ACAT-like coding region was removed from 
pCGN8626 by digestion with Sse8387 and Not I. The fragment was ligated into Pstl-Not I 
digested pCGN8624. The resulting plasmid was designated pCGN8629. DNA sequence 
analysis confirmed the integrity of the cloning junctions. 

A fragment containing the Arabidopsis ACAT-like coding region was removed from 
pCGN8626 by digestion with Sse8387 and Not I. The fragment was ligated into Pstl-Not I 
digested pCGN8625. The resulting plasmid was designated pCGN8630. DNA sequence 
analysis confirmed the integrity of the cloning junctions. 
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An additional expression construct for the suppression of endogenous ACAT-like 
activity was also prepared. The construct pCGN8660 was constructed by cloning 
approximately 1 Kb of the Arabidopsis ACAT-like coding region from pCGN8626 in the 
sense orientation, and the full-length Arabidopsis ACAT-like coding region in the antisense 
orientation under the regulatory control of the napin transcription initiation sequence. 

For expression of the rat ACAT-like sequence in plants, the NotI-Sse8387I fragment 
of pCGN8592 was cloned into Notl-PstI digested binary vectors pCGN8621, pCGN8622, and 
P CGN8624 to yield plasmids, pCGN 9700, pCGN9701, and pCGN9702, respectively. 
Plasmid pCGN9700 expresses a sense transcript of the rat ACAT-like cDNA under control of 
a napin promoter, plasmid pCGN9701 expresses an antisense transcript of the rat ACAT-like 
cDNA under control of a napin promoter, and plasmid pCGN9702 expresses a sense 
transcript of the rat ACAT-like cDNA under control of a double 35S promoter. Plasmids 
pCGN 9700, pCGN9701, and pCGN9702 were introduced in Agrobacterium tumefaciens 
EHA101. 

Constructs were prepared to direct the expression of the rat ACAT-like sequence in 
the seed embryo of soybean and the endosperm of corn. For expression of the rat ACAT-like 
DNA sequence in soybean, a 1.5 kb NotVSse&ZXll fragment from pCGN8592 containing the 
coding sequence of the rat ACAT-like sequence was blunt ended using Mung bean nuclease, 
and ligated into the Smal site of the turbo 7S binary/cloning vector pCGN8809 to create the 
vector pCGN8817 for transformation into soybean by particle bombardment. The vector 
pCGN8817 contained the operably linked components of the promoter region of the soybean 
a' subunit of p-conglycinin (7S promoter, (Chen et al, (1986), Proc. Natl. Acad. Set, 
83:8560-8564), the DNA sequence coding for the entire rat ACAT-like protein, and the 
transcriptional termination region of pea RuBisCo small subunit, referred to as E9 3' 
(Coruzzi, et al (1984) EMBOJ. 3:1671-1679 and Morelli, et al (1985) Nature 315:200-204). 
This construct further contained sequences for the selection of positive transformed plants by 
screening for resistance to glyphosate using the CP4 EPSPS (U.S. Patent 5,633,435) 
expressed under the control of the figwort mosaic virus (FMV) promoter (U.S. Patent 
Number 5,378,619) and the transcriptional termination region of E9. 

For expression of the rat ACAT-like sequence in the corn endosperm, a 1.5 kb 
NotySseS3STL fragment from pCGN8592 containing the coding sequence of the rat ACAT- 
like sequence was blunt ended using Mung bean nuclease, and ligated into the BamHl site of 
the rice pGtl expression cassette pCGN8592 for expression from the pGtl promoter (Leisy, 
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DJ. et al., Plant Mol. Biol. 14 (1989) 41-50) and the HSP70 intron sequence (U.S. Patent 
Number 5,593,874). This cassette also included the transcriptional termination region 
downstream of the cloning site of nopaline synthase, nos 3' (Depicker et al., J. Molec. Appl. 
Genet. (1982) 1 : 562-573). A 7.5 kb fragment containing the pGtl promoter, the DNA 
5 sequence encoding the rat ACAT-like protejp, and the nos transcriptional termination 

sequence was cloned into the binary vector pCGN8816 to create the vector pCGN88 1 8 for 
transformation into corn. This construct also contained sequences for the selection of positive 
transformants with kanamycin using the kanarnycin resistance gene from Tn5 bacteria under 
the control of the CAMV 35S promoter and tml transcriptional termination regions. 

10 

Example 5: Expression in Insect Cell Culture 

A baculovirus expression system was used to express the LCAT cDNAs in cultured 
insect cells. 

1 5 The baculovirus expression constructs pCGN9992, pCGN9993, pCGN9994, 

pCGN10900, pCGN10962, and pCGN10967 were transformed and expressed using the 
BAC-to-BAC Baculovirus Expression System (Gibco-BRL, Gaithersburg, MD) according to 
the manufacturer's directions. 

The transformed insect cells were used to assay for acyltransferase activities using 

20 methods known in the art (see Example 8). 

Example 6: Plant Transformation 

A variety of methods have been developed to insert a DNA sequence of interest into the 
25 genome of a plant host to obtain the transcription or transcription and translation of the sequence 
to effect phenotypic changes. Transgenic plants were obtained by Agrobacterium-mQdinted 
transformation as described by Radke et al. (Theor. Appl. Genet. (1988) 75:685-694; Plant Cell 
Reports (1992) ii:499-505). Alternatively, microprojectile bombardment methods, such as 
described by Klein et al. (Bio/Technology 10:286-291) may also be used to obtain nuclear 
30 transformed plants. Other plant species may be similarly transformed using related techniques. 

The plant binary constructs described above were used in plant transformation to direct 
the expression of the sterol acyltransferases in plant tissues. Binary vector constructs were 
transformed into strain EHA101 Agrobacterium cells (Hood et al., J. Bacterial (1986) 1 68: 1291- 
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1301), by the method of Holsters etal. (Mol. Gen. Genet. (1978) 765:181-187). Transgenic 
Arabidopsis thaliana plants were obtained by Agrobacterium-mediaXQd transformation as 
described by Valverkens et al., (Proc. Nat. Acad. Sci. (1988) §5:5536-5540), Bent et al. ((1994), 
Science 265:1856-1860), and Bechtold et al. ((1993), C. R.Acad. Sci, Life Sciences 316:1 194- 
5 1199). > 

EXAMPLE 7: Plant Assays for Modified Sterol Content/Profile 

7a: NMRofT2seed 

10 Seed from plants expressing LCAT 1 through 4 under the control of the napin 

promoter were analyzed by NMR. Arabidopsis seeds from transgenic plants were placed 
directly into wide-mouth MAS NMR sample tubes. 

High-resolution spectra were measured at 11.7 T (1H=500 MHz, 13C=125 mHz) 
using Varian NMR Instruments (Palo Alto, CA) Inova™ NMR spectrometers equipped with 

15 carbon-observe MAS Nanoprobes™. The 13C spectra were acquired without a field- 
frequency lock at ambient temperature (approx. 21-22°C) for 14 hours using the following 
conditions: spectral width = 29.996 kHz, acquisition time = 2.185 seconds, p/2 pulse (3.8 ms) 
with no relaxation delay, 1H g B2 = 2.5 kHz with Waltz decoupling. Data processing 
conditions were typically: digital resolution = 0.1 1 Hz, 0.3 to 1.5 Hz line broadening and 

20 time-reversed linear prediction of the first three data points. Chemical shifts were referenced 
by adding neat tetramethylsilane (TMS) to Arabidopsis seeds and using the resulting 
referencing parameters for subsequent spectra. The 13C resolution was 2-3 Hz for the most 
narrow seed resonances. Spectral resolution was independent of MAS spinning speeds (1.5- 
3.5 kHz) and data were typically obtained with 1.5 kHz spinning speeds. Spinning sidebands 

25 were approx. 1% of the main resonance. Phytosterol 1 3C assignments were based on model 
samples composed of triolein, P-sitosterol and cholesterol oleate. Triacylglycerol 13C 
assignments were made from comparison with literature assignments or with shifts computed 
from a 13C NMR database (Advanced Chemical Development, Inc., version 3.50, Toronto 
Canada). 

30 The results of these analyses are displayed in Figure 2 and show that there was a trend 

of an approximately 2 fold increase of phytosterols in the seeds derived from plant line 5 
expressing the LCAT 4 gene (pCGN9998) under the control of the napin promoter. During 
the course of this analysis it was also noted that the average oil content of seed from plants 
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expressing the LCAT2 construct (pCGN9983) under the control of the napin promoter was 
higher than that of controls. This is the first in planta evidence supporting the concept that 
overexpression of a nucleotide sequence encoding a lecithinxholesterol acyltransferase-like 
polypeptide can increase oil content. 
5 > 
7b: HPLC/MS of T2 seed 

Seed oil from T2 plants expressing LCAT1 through 4 under the control of the napin 
promoter was extracted using an accelerated solvent extractor (ASE) method. Seed samples 
were ground, using a mortar and pestle, to achieve a fine homogeneous meal. Oil was 
10 obtained using a Dionex Accelerated Solvent Extractor (ASE). Clean ground seed was added 
to an equal amount of diatomaceous earth. The ground seed sample and the diatomaceous 
earth were thoroughly mixed until a homogeneous texture was achieved. The sample was 
then loaded into the instrument and oil extraction was achieved using hexane under validated 
laboratory protocols. 

15 Oil from these seed samples was then analyzed for sterol ester analysis using 

HPLC/MS for free campesterol, stigmasterol, and sitosterol and their fatty acid esters. To the 
autosampler vial containing approximately 0.1 grams oil was added 0.3 mLs CDC1 3 . One- 
hundred microliters of this solution was added to 900 microliters CHC1 3 . Five microliters of 
this diluted sample was subsequently injected into an HPLC/MS with positive ion 

20 atmospheric pressure ionization. The individual components in the oils were separated using 
two 4.6 x 50 mm C 8 Zorbax columns in series and a gradient using acetonitrile and 
acetonitrile with 40% CHC1 3 . The sterol concentrations were calculated assuming each sterol 
and its fatty acids have the same molar responses. This was observed to be the case with 
cholesterol and its esters and was assumed to be the case for campesterol, stigmasterol, and 

25 sitosterol. In the present study, the sterol identified as stigmasterol was actually an isomer of 
this compound. 

The results of these analyses are displayed in Figures 3 and 4 and show that there 
were sterol ester enhancements on the order of 50%. in the seeds derived from six out of 
seven T2 plant lines expressing LCAT3 (pCGN9968) under the control of the napin 
30 promoter. 
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EXAMPLE 8: Baculo virus Insect Cell Culture for Sterol Esterification Activity 

Baculovirus expression construct pCGN9992, pCGN9993, pCGN9994 and 
pCGN10900 (see Example 4) were transformed and expressed using the BAC-TOBAC 
5 Baculovirus Expression System (Gibco-BRL, Gaithersburg, MD) according to the 

manufacturer's instructions except harvesting of recombinant viruses was done 5 days post- 
transfection. The supernatant from the transfection mixture was used for generating virus 
stock which in turn was used for infecting Sf9 cells used in the assay. 

The transformed cells were assayed for lecithin: sterol acyltransferase activities using 

10 the method described herein. Insect cells were centrifuged and the resulting cell pellet was 
either used immediately or stored at -70 C for later analysis. Cells were resuspended in 
Medium A (100 mM Tricine/NaOH, pH 7.8, 10% (w/v) glycerol, 280 mM NaCl with : 0.1 
uM Aprotinin, 1 \xM Leupeptin, and 100 uM Pefabloc (all from Boehringer Mannheim, 
Germany) and lysed by sonication (2x10 sec). Cell walls and other debris were pelleted by 

15 centrifugation (14,000 x g , 10 min, 4°C). The supernatant was transferred to a new vial and 
membranes pelleted by centrifugation (100,000 x g, Ti 70.1 rotor, 46,000 rpm for 1 hour at 
4°C). Total membranes were resuspended in Medium A. Lecithimsterol acyltransferase 
activity was assayed in a 0.1 ml reaction mixture containing 100 mM Tris/HCl, pH 7, 28 mM 
NaCl, 0.03% Triton X-100, 0.1 mM sitosterol, 20 uM l,2-[ 14 C]-palmitoyl-phosphatidyl 

20 choline (246420 dpm/nmole), and 0.05-20 mg of membrane protein. After 15 minutes at 30 
°C, the reaction was terminated by addition of a 0.5 ml solution of methylene 
chloride:methanol (4:1, v/v ) containing 100 jag cholesterol and cholesterol ester as cold 
carriers. A portion (0.1 ml) of the bottom organic layer was removed and evaporated under 
nitrogen gas. The concentrated extract was resuspended in 30 p.1 of hexane and spotted onto a 

25 silica gel-G thin layer chromatographic plate. The plate was migrated in hexane:diethyl 
ether: acetic acid (80:20:1) to the top, then air dried. Radioactivity was determined by 
exposure to a Low Energy Phosphor-imaging Screen. Following exposure, the screen was 
read on a phosphorimager. 

The LCAT 4 protein from pCGN10900 in baculovirus membranes showed a 

30 radioactive spot in the region of the TLC plate where cholesterol ester migrates indicating 
that LCAT 4 has the ability to catalyze the transfer of an acyl group from lecithin (PC) to 
sitosterol to make a sitosterol ester. 
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EXAMPLE 9: Plant Assay for Modified Lipid Content 

Nir (near infrared spectroscopy spectral scanning) can be used to determine the total 
oil content of Arabidopsis seeds in a non-destructive way provided that a spectral calibration 
curve has been developed and validated for .seed oil content. A seed oil spectral calibartion 
curve was developed using seed samples from 85 Arabidopsis plants. Seed was cleaned and 
scanned using a Foss NIR model 6500 (Foss-Nirs Systems, Inc.). Approximately 50 to 100 
milligrams of whole seeds, per sample, were packed in a mini sample ring cup with quartz 
lens [ IH-0307 ] consisting a mini-insert [ IH-0337 ] and scanned in reflectance mode to 
obtain the spectral data. The seed samples were then ground, using a mortar and pestle, to 
achieve a fine homogeneous meal. The ground samples were measured for oil using an 
accelerated solvent extractor (ASE). 

Measurement for the total oil content was performed on the Dionex Accelerated 
Solvent Extractor (ASE). Approximately 500 mg of clean ground seed was weighed to the 
nearest 0.1 mg onto a 9 x 9 cm weigh boat. An equal amount of diatomaceous earth was 
added using a top-loading balance accurate to the nearest 0.01 g. The ground seed sample 
and the diatomaceous earth were thoroughly mixed until a homogeneous texture was 
achieved. The sample was loaded on to the instrument and oil extraction was achieved using 
hexane under validated laboratory protocols. Standard Rapeseed samples were obtained from 
the Community Bureau of Reference (BCR). The ASE extraction method was validated 
using the BCR reference standards. A total percent oil recovery of 99% to 100% was 
achieved. "As-is" oil content was calculated to the nearest 0.01 mass percentage using the 
formula: 

Oil Content = 100% x (vial plus extracted oil wt - initial vial wt) / (sample wt) 

The analytical data generated by ASE were used to perform spectral calibrations. Nir 
calibration equations were generated using the built-in statistical package within the 
NirSytems winisi software. The spectral calibration portion of the software is capable of 
calibration and self-validation. From a total of 85 samples, 57 samples were used to generate 
the total percent oil calibration. The remaining samples were used to validate the oil 
calibrations. Optimized smoothing, derivative size, and mathematical treatment (modified 
partial least square) was utilized to generate the calibration. The samples that were not used 
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in building respective calibrations were used as a validation set. Statistical tools such as 
correlation coefficient ( R ), coefficient of determination (R 2 ), standard error of prediction ( 
SEP ), and the standard error of prediction corrected for bias (SEPC) were used to evaluate 
the calibration equations. 
5 T2 seeds from plants that had been transformed with the LCAT genes were cleaned 

and scanned using a Foss NIR model 6500 (Foss-Nirs Systems, Inc.). Approximately 50 to 
100 milligrams of whole seeds, per sample, were packed in a mini sample ring cup with 
quartz lens [ IH-0307 ] consisting a rnini-insert [ IH-0337 ] and scanned in reflectance mode 
to obtain the spectral data. Oil percentage in each seed sample was determined using the seed 

10 oil spectral calibration curve detailed above. 

The results of these analyses are found in Figure 5 and Table 2 and show that there 
was a significant increase in the oil level in seed from T2 plants expressing the LCAT2 gene. 
This increase in oil was seen in plants when LCAT2 was driven by either the 35S constitutive 
promoter or the seed-specific napin promoter. These results show that overexpression of a 

15 nucleic acid sequence encoding a Iecithinxholesterol acyltransferase-like polypeptide can 
increase seed oil production in plants. 
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Table 2 





Construct number 


Seed Oil Percentage (%) 


CONTROL 




24.7 


CONTROL 




28.0 


CONTROL 


r 


31.8 


CONTROL 




32.4 


NAPIN LCAT1 


PCGN9962 


28.5 


NAPIN LCAT1 


PCGN9962 


28.9 


NAPIN LCAT1 


PCGN9962 


29.6 


NAPIN LCAT1 


PCGN9962 


30.1 


NAPIN LCAT1 


PCGN9962 


30.1 


NAPIN LCAT1 


PCGN9962 


30.1 


NAPIN LCAT1 


PCGN9962 


30.8 


NAPIN LCAT1 


PCGN9962 


31.0 


NAPIN LCAT1 


pCGN9962 


32.1 


NAPIN LCAT1 


pCGN9962 


34.2 


NAPIN LCAT3 


pCGN9968 


26.8 


NAPIN LCAT3 


pCGN9968 


27.4 


NAPIN LCAT3 


pCGN9968 


29.0 


NAPIN LCAT3 


pCGN9968 


29.0 


NAPIN LCAT3 


pCGN9968 


32.6 


NAPIN LCAT2 


pCGN9983 


26.5 


NAPIN LCAT2 


pCGN9983 


34.7 


NAPIN LCAT2 


pCGN9983 


34.8 


NAPIN LCAT2 


pCGN9983 


35.7 


NAPIN LCAT2 


pCGN9983 


35.8 


NAPIN LCAT2 


pCGN9983 


36.3 


NAPIN LCAT2 


pCGN9983 


36.7 


NAPIN LCAT2 


pCGN9983 


37.0 


NAPIN LCAT2 


pCGN9983 


37.2 


NAPIN LCAT2 


pCGN9983 


37.3 


NAPIN LCAT2 


pCGN9983 


37.3 


NAPIN LCAT2 


pCGN9983 


37.4 


NAPIN LCAT2 


pCGN9983 


37.8 


NAPIN LCAT2 


pCGN9983 


38.0 


NAPIN LCAT2 


pCGN9983 


38.0 


35S LCAT2 


pCGN9981 


27.3 


35 S LCAT2 


pCGN9981 


28.1 


35S LCAT2 


pCGN9981 


28.2 


35S LCAT2 


pCGN9981 


28.6 


35 S LCAT2 


pCGN9981 


29.8 


35S LCAT2 


pCGN9981 


30.3 


35S LCAT2 


pCGN9981 


32.4 


35S LCAT2 


pCGN9981 


32.5 


35S LCAT2 


pCGN9981 


33.6 


35S LCAT2 


P CGN9981 


34.1 


35S LCAT2 


pCGN9981 


35.5 


35S LCAT2 


pCGN9981 


36.4 
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35 S LCAT2 


pCGN9981 


37.1 


35S LCAT2 


pCGN9981 


38.3 


35S LCAT2 


pCGN9981 


38.5 


35S LCAT2 


pCGN9981 


39.1 



In light of the detailed description of the invention and the examples presented above, 
it can be appreciated that the several aspects of the invention are achieved. 

5 It is to be understood that the present invention has been described in detail by way of 

illustration and example in order to acquaint others skilled in the art with the invention, its 
principles, and its practical application. Particular formulations and processes of the present 
invention are not limited to the descriptions of the specific embodiments presented, but rather 
the descriptions and examples should be viewed in terms of the claims that follow and their 

10 equivalents. While some of the examples and descriptions above include some conclusions 
about the way the invention may function, the inventors do not intend to be bound by those 
conclusions and functions, but put them forth only as possible explanations. 

It is to be further understood that the specific embodiments of the present invention as 
set forth are not intended as being exhaustive or limiting of the invention, and that many 

1 5 alternatives, modifications, and variations will be apparent to those of ordinary skill in the art 
in light of the foregoing examples and detailed description. Accordingly, this invention is 
intended to embrace all such alternatives, modifications, and variations that fall within the 
spirit and scope of the following claims. 



What is claimed is: 
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1 . An isolated nucleic acid sequence comprising a polynucleotide encoding a plant 
lecithinxholesterol acyltransferase-like polypeptide or fragment thereof. 

2. The isolated nucleic acid sequence of claim 1, wherein said plant lecithinxholesterol 
acyltransferase-like polypeptide is selected from the group consisting of Arabidopsis, 
soybean and corn. 

3. An isolated nucleic acid sequence comprising a polynucleotide encoding a plant acyl 
CoAxholesterol acyltransferase-like polypeptide. 

4. The isolated nucleic acid sequence of claim 3, wherein said polynucleotide is SEQ ID 
NO: 42 or degenerate variants thereof. 

5 . The isolated nucleic acid sequence of claim 1 , wherein said polynucleotide is selected 
from the group consisting of SEQ ID NO: 2, 4, 6, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 
19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 43, 44, 45, 46, 47, 48, 49, 50, 51, 73 and 75 
or degenerate variants thereof. 

6. An isolated nucleic acid sequence consisting essentially of SEQ ID NO: 2, 4, 6, 8, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 42, 43, 44, 45, 
46, 47, 48, 49, 50, 51 73 or 75. 

7. An isolated nucleic acid sequence consisting of SEQ ID NO: 2, 4, 6, 8, 10, 11, 12, 13, 
14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 42, 43, 44, 45, 46, 47, 48, 
49, 50, 51 73 or 75. 

8. An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO 3 or SEQ ID 
NO 3 with at least one conservative amino acid substitution; 
5 b) SEQ ID NO: 2; 

c) an isolated polynucleotide that has at least 70% sequence identity to SEQ ID 
NO: 2; 
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d) an isolated polynucleotide that has at least 80% sequence identity to SEQ ID 
NO: 2; 

e) an isolated polynucleotide that has at least 90% sequence identity to SEQ ID 
NO: 2; 

f) an isolated polynucleotide that has at least 95% sequence identity to SEQ ID 
NO: 2; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 2; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 2 and encodes a plant lecithin: cholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5* X-(Ri) n -(R 2 ) n -(R 3 ) n -Y 3', where X is hydrogen, Y is hydrogen or a metal, 
Rj and R 3 are any nucleic acid, n is an integer between 0-3000, and R 2 is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 3 or SEQ 
ID NO: 3 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 2; 

c) an isolated polynucleotide that has at least 70% sequence identity to SEQ ID 
NO: 2; 

d) an isolated polynucleotide that has at least 80% sequence identity to SEQ ED 
NO: 2; 

e) an isolated polynucleotide that has at least 90% sequence identity to SEQ ED 
NO: 2; 

f) an isolated polynucleotide that has at least 95% sequence identity to SEQ ED 
NO: 2; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 2; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 
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i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 2 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 5 or SEQ 
ID NO: 5 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 4; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 4; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 4; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 4; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 4; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 4; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 4 and encodes a plant lecithin:cholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-(R 1 ) n -(R 2 ) n -(R 3 ) Il -Y 3', where X is hydrogen, Y is hydrogen or a metal, 
Rj and R 3 are any nucleic acid, n is an integer between 0-3000, and R 2 is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 5 or SEQ 
ID NO: 5 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 4; 
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c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 4; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 4; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 4; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 4; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 4; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d), (e), (f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 4 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 7 or SEQ 
ID NO: 7 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 6; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 6; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 6; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 6; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 6; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 6; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e),(f)or(g);and 
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i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 6 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-^X-^X-^X-Y 3\ where X is hydrogen, Y is hydrogen or a metal, 
Rj and R 3 are any nucleic acid, n is an integer between 0-3000, and R 2 is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 7 or SEQ 
ID NO: 7 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 6; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 6; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO:6; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 6; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 6; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 6; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d), (e), (f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 6 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 9 or SEQ ID 
NO: 9 with at least one conservative amino acid substitution; 

b) SEQ ID NO 8; 
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c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 8; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 8; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 8; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 8; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 8; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), (d), 
(e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ ID 
NO: 8 and encodes a plant lecithinxholesterol acyltransferase-like polypeptide. 

An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-CR^-CRA-CRaVY 3', where X is hydrogen, Y is hydrogen or a metal, 
Rj and R 3 are any nucleic acid, n is an integer between 0-3000, and R 2 is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 9 or SEQ 
ID NO: 9 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 8; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 8; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 8; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 8; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 8; 

g) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 8; 
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h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d), (e), (f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 8 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 74 or SEQ 
ID NO: 74 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 73; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 73; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 73; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 73; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 73; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 73; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d), (e), (f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 73 and encodes a plant lecithin: cholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-(R,)„-<R 2 ) n -(R3) ll -Y 3', where X is hydrogen, Y is hydrogen or a metal, 
R, and R 3 are any nucleic acid, n is an integer between 0-3000, and R 2 is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 74 or SEQ 
ID NO: 74 with at least one conservative amino acid substitution; 
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b) SEQ ID NO: 73; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 73; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 73; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 73; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ED 
NO: 73; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ED NO: 73; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d), (e), (f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ED NO: 73 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

A isolated nucleic acid sequence comprising a polynucleotide selected from the group 
consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ED NO: 76 or SEQ 
ED NO: 76 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 75; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ED 
NO: 75; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ED 
NO: 75; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ED 
NO: 75; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ED 
NO: 75; ~* 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ED NO: 75; 
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h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d),(e), (f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 75 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-^-^n-^-Y 3', where X is hydrogen, Y is hydrogen or a metal, 
R, and R 3 are any nucleic acid, n is an integer between 0-3000, and R 2 is selected from 
the group consisting of: 

a) an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 76 or SEQ 
ID NO: 76 with at least one conservative amino acid substitution; 

b) SEQ ID NO: 75; 

c) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 75; 

d) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 75; 

e) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 75; 

f) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 75; 

g) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 75; 

h) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d), (e),(f)or(g);and 

i) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ED NO: 75 and encodes a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence comprising a polynucleotide selected from the 
group consisting of: 

a) SEQ ID NO: 42 or a degenerate variant thereof; 
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b) an isolated polynucleotide having at least 70% sequence identity with SEQ ID 
NO: 42; 

c) an isolated polynucleotide having at least 80% sequence identity with SEQ ID 
NO: 42; 

d) an isolated polynucleotide having at least 90% sequence identity with SEQ ID 
NO: 42; 

e) an isolated polynucleotide having at least 95% sequence identity with SEQ ID 
NO: 42; 

f) an isolated polynucleotide of at least 1 0 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 42; 

g) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d), (e), or (f); and 

h) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 42 and encodes an acyl CoAxholesterol acyltransferase-like 
polypeptide. 

An isolated nucleic acid sequence consisting essentially of a polynucleotide of the 
formula 5' X-CRO^CR^-^^-Y 3', where X is hydrogen, Y is hydrogen or a metal Rj 
and R 3 are any nucleic acid, n is an integer between 0 and 3000, and R 2 is selected 
from the group consisting of: 

a) SEQ ID NO: 42 or degenerate variants thereof; 

b) an isolated polynucleotide having at least 70% sequence identity to SEQ ID 
NO: 42; 

c) an isolated polynucleotide having at least 80% sequence identity to SEQ ID 
NO: 42; 

d) an isolated polynucleotide having at least 90% sequence identity to SEQ ID 
NO: 42; 

e) an isolated polynucleotide having at least 95% sequence identity to SEQ ID 
NO: 42; 

f) an isolated polynucleotide of at least 10 nucleic acids that hybridizes under 
stringent conditions to SEQ ID NO: 42; 

g) an isolated polynucleotide complementary to a polynucleotide of (a), (b), (c), 
(d), (e), or (f); and 
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h) an isolated polynucleotide that hybridizes under stringent conditions to SEQ 
ID NO: 42 and encodes an acyl CoAxholesterol acyltransferase-like 
20 polypeptide. 

22. A recombinant nucleic acid construct comprising a regulatory sequence operably 
linked to polynucleotide encoding a lecithinxholesterol acyltransferase-like 
polypeptide or a fragment thereof. 

23 . The recombinant nucleic acid construct of claim 22, wherein said lecithinxholesterol 
acyltransferase-like polypeptide is a plant lecithinxholesterol acyltransferase-like 
polypeptide. 

24. A recombinant nucleic acid construct comprising a regulatory sequence operably 
linked to a polynucleotide encoding an acyl CoAxholesterol acyltransferase-like 
polypeptide. 

25 . The recombinant nucleic acid construct of claim 24, wherein said acyl 
CoAxholesterol acyltransferase-like polypeptide is a plant acyl CoAxholesterol 
acyltransferase-like polypeptide. 

26. The recombinant construct of claim 22, wherein said regulatory sequence comprises a 
heterologous regulatory sequence. 

27. The recombinant construct of claim 24, wherein said regulatory sequence comprises a 
heterologous regulatory sequence. 

28 . The recombinant construct of claim 22, wherein said regulatory sequence is functional 
in a plant cell. 

29. The recombinant construct of claim 24, wherein said regulatory sequence is functional 
in a plant cell. 



30. 



The recombinant construct of claim 22, further comprising a termination sequence. 
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31. The recombinant construct of claim 24 further comprising a termination sequence. 

32. The recombinant construct of claim 22 wherein said polynucleotide is selected from 
the group consisting of SEQ ID NO: 2, 4, 6, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 
20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 43, 44, 45, 46, 47, 48, 49, 50, 51, 73 and 75. 

33 . The recombinant construct of claim 24, wherein said polynucleotide is selected from 
the group consisting of SEQ ID NO: 33 and 42. 

34. The recombinant construct of claim 22, wherein said regulatory sequence comprises a 
constitutive promoter. 

35. The recombinant construct of claim 24, wherein said regulatory sequence comprises a 
constitutive promoter. 

36. The recombinant construct of claim 22, wherein said regulatory sequence comprises 
an inducible promoter. 

37. The recombinant construct of claim 24, wherein said regulatory sequence comprises 
an inducible promoter. 

38. The recombinant construct of claim 22, wherein said regulatory sequence is selected 
from the group consisting of a tissue specific promoter, a developmental^/ regulated 
promoter, an organelle specific promoter, and a seed specific promoter. 

39. The recombinant construct of claim 24, wherein said regulatory sequence is selected 
from the group consisting of a tissue specific promoter, a developmental^ regulated 
promoter, an organelle specific promoter, and a seed specific promoter. 

40. A host cell containing the recombinant construct of claim 22 or 24. 
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41 . The host cell of claim 40, wherein said host cell is selected from the group consisting 
of plant cells, animal cells, insect cells, yeast, bacteria, bacteriophage and viruses. 

42. The host cell of claim 40, wherein said host cell is a plant cell. 

43 . The host cell of claim 40, wherein said host cell expresses a lecithinxholesterol 
acyltransferase-like polypeptide or an acyl CoAxholesterol acyltransferase-like 
polypeptide. 

44. The host cell of claim 43 , wherein said cholesterol acyltransferase-like polypeptide is 
a plant acyltransferase-like polypeptide. 

45. A plant comprising at least one host cell of claim 40. 

46. The progeny of a plant of claim 45 . 

47. A seed from the plant of claim 45. 

48 . A plant comprising the recombinant construct of claim 22 or 24. 

49. The progeny of a plant of claim 48. 

50. A seed from the plant of claim 48. 

51. A purified polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 3, SEQ ID NO: 3 with at least one conservative amino acid 
substitution, SEQ ID NO: 5, SEQ ED NO: 5 with at least one conservative amino acid 
substitution, SEQ ED NO: 7, SEQ ID NO: 7 with at least one conservative amino acid 

5 substitution, SEQ ID NO: 9, SEQ ED NO: 9 with at least one conservative amino acid 

substitution, SEQ ID NO: 74, SEQ ED NOl 74 with at least one conservative amino 
acid substitution, SEQ ID NO: 76 and SEQ ED NO: 76 with at least one conservative 
amino acid substitution. 
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A purified immunogenic polypeptide comprising at least 10 consecutive amino acids 
from an amino acid sequence selected from the group consisting of SEQ ID NO: 3, 
SEQ ID NO: 3 with at least one conservative amino acid substitution, SEQ ID NO: 5, 
SEQ ID NO: 5 with at least one conservative amino acid substitution, SEQ ID NO: 7, 
SEQ ID NO: 7 with at least one conservative amino acid substitution, SEQ ID NO: 9, 
SEQ ID NO: 9 with at least one conservative amino acid substitution, SEQ ED NO: 
74, SEQ ED NO: 74 with at least one conservative amino acid substitution, SEQ ED 
NO: 76 and SEQ ED NO: 76 with at least one conservative amino acid substitution. 

An antibody which specifically binds to an immunogenic polypeptide of claim 52. 

A method for producing a lecithin: cholesterol acyltransferase-like polypeptide or an 
acyl CoAxholesterol acyltransferase-like polypeptide comprising culturing a host cell 
of claim 40 under conditions permitting expression of said lecithin: cholesterol 
acyltransferase-like polypeptide or acyl CoAxholesterol acyltransferase-like 
polypeptide. 

The method of claim 54, further comprising isolating the cholesterol acyltransferase- 
like polypeptide from the host cell or from the medium in which the host cell is 
cultured. 

A method for modifying the sterol content of a host cell, comprising fransforming a 
host cell with a recombinant construct containing a regulatory sequence operably 
linked to a polynucleotide encoding a lecithin: cholesterol acyltransferase-like 
polypeptide and culturing said host cell under conditions wherein said host cell 
expresses a lecithinxholesterol acyltransferase-like polypeptide such that said host 
cell has a modified sterol composition as compared to host cells without the 
recombinant construct. 

The method of claim 56, wherein said lecithinxholesterol acyltransferase-like 
polypeptide is a plant lecithinxholesterol acyltransferase-like polypeptide. 
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A method for modifying the sterol content of a host cell, comprising transforming a 
host cell with a recombinant construct containing a regulatory sequence operably 
linked to a polynucleotide encoding an acyl CoA:cholesterol acyltransferase-like 
polypeptide and culturing said host cell under conditions wherein said host cell 
expresses an acyl CoAxholesterol acyltransferase-like polypeptide such that said host 
cell has a modified sterol composition as compared to host cells without the 
recombinant construct. 

The method of claim 58, wherein said acyl CoAxholesterol acyltransferase-like 
polypeptide is a plant acyl CoAxholesterol acyltransferase-like polypeptide. 

The method of claim 56, wherein said modified sterol composition is an increase in 
sterol esters. 

The method of claim 58, wherein said modified sterol composition is an increase in 
sterol esters. 

The method of claim 56, wherein said polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide is selected from the group consisting of SEQ ID NO: 
2, 4, 6, 8, 10, 11,73 and 75. 

The method of claim 58 wherein said polynucleotide encoding a acyl CoAxholesterol 
acyltransferase-like polypeptide is SEQ ID NO 33 OR 42. 

The method of claim 56, wherein said regulatory sequence comprises a constitutive 
promoter. 

The method of claim 58, wherein said regulatory sequence comprises a constitutive 
promoter. 

The method of claim 56, wherein said regulatory sequence is an inducible promoter. 



The method of claim 58, wherein said regulatory sequence is an inducible promoter. 
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The method of claim 56, wherein said regulatory sequence is a tissue specific 
promoter. 

The method of claim 58, wherein said regulatory sequence is a tissue specific 
promoter. 

The method of claim 56, wherein said regulatory sequence is a seed specific promoter. 

The method of claim 58, wherein said regulatory sequence is a seed specific promoter. 

The method of claim 56, wherein said polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide is in the antisense orientation. 

The method of claim 58, wherein said polynucleotide encoding an acyl 
CoAxholesterol acyltransferase-like polypeptide is in the antisense orientation. 

The method of claim 72, wherein said modified sterol composition is a decrease in 
sterol esters. 

The method of claim 73, wherein said modified sterol composition is a decrease in 
sterol esters. 

A plant comprising a recombinant construct containing a regulatory sequence 
operably linked to a polynucleotide encoding a lecithinxholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in modified 
sterol composition of said plant as compared to the same plant without said 
recombinant construct. 

The plant of claim 76, wherein said lecithmxholesterol acyltransferase-like 
polypeptide is a plant lecithinxholesterol acyltransferase-like polypeptide. 
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78. The plant of claim 76, wherein said polynucleotide encoding a lecithin: cholesterol 
acyltransferase-like polypeptide is selected from the group consisting of SEQ ID NO: 
2, 4, 6, 8, 10, 11,73 and 75. 

79. A plant comprising a recombinant construct containing a regulatory sequence 
operably linked to a polynucleotide encoding an acyl Co A: cholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in modified 
sterol composition of said plant as compared to the same plant without said 

5 recombinant construct. 

80. The plant of claim 79, wherein said acyl Co A: cholesterol acyltransferase-like 
polypeptide is a plant acyl CoAxholesterol acyltransferase-like polypeptide. 

81 . The plant of claim 79, wherein said polynucleotide encoding an acyl CoAxholesterol 
acyltransferase-like polypeptide is SEQ ID NO: 33 or 42. 

82. The plant of claim 76, wherein said regulatory sequence comprises a tissue specific 
promoter. 

83. The plant of claim 79, wherein said regulatory sequence comprises a tissue specific 
promoter. 

84. The plant of claim 76, wherein said regulatory sequence comprises a seed specific 
promoter. 

85 . The plant of claim 79, wherein said regulatory sequence comprises a seed specific 
promoter. 

86. The plant of claim 76, wherein said modified sterol composition is an increase in 
sterol esters. "* 



87. The plant of claim 79, wherein said modified sterol composition is an increase in 
sterol esters. 
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88. The plant of claim 76, wherein the polynucleotide encoding a lecithinxholesterol 
acyltransferase-like polypeptide is in the antisense orientation. 

89. The plant of claim 79, wherein the polynucleotide encoding an acyl CoAxholesterol 
acyltransferase-like polypeptide is in the antisense orientation. 

90. An oil obtained from the plant of claim 76 or 79. 

91 . A method for producing an oil with a modified sterol composition comprising, 
providing a plant of claim 76 or 79 and extracting the oil from said plant. 

92. An oil produced by the method of claim 9 1 . 

93. A method for altering oil production by a host cell comprising, transforming a host 
cell with a recombinant construct containing a regulatory sequence operably linked to 
a polynucleotide encoding a lecithinxholesterol acyltransferase-like polypeptide and 
culturing said host cell under conditions wherein said host cell expresses a 

5 lecithinxholesterol acyltransferase-like polypeptide such that said host cell has an 

altered oil production as compared to host cells without the recombinant construct. 

94. The method of claim 93, wherein said lecithinxholesterol acyltransferase-like 
polypeptide is a plant lecithinxholesterol acyltransferase-like polypeptide. 

95. A method for altering oil production by a host cell comprising, transforming a host 
cell with a recombinant construct containing a regulatory sequence operably linked to 
a polynucleotide encoding an acyl CoAxholesterol acyltransferase-like polypeptide 
and culturing said host cell under conditions wherein said host cell expresses an acyl 

5 CoAxholesterol acyltransferase-like polypeptide such that said host cell has an altered 

oil production as compared to host cells without the recombinant construct. 

96. The method of claim 95, wherein said acyl CoAxholesterol acyltransferase-like 
polypeptide is a plant acyl CoAxholesterol acyltransferase-like polypeptide. 
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97. The method of claim 93 , wherein said oil production is increased. 

98. The method of claim 95, wherein said oil production is increased. 

99. The method of claim 93, wherein said host cell is a plant cell. 

1 00. The method of claim 95, wherein said host cell is a plant cell. 

101. The method of claim 93, wherein said polynucleotide encoding a lecithin: cholesterol 
acyltransferase-like polypeptide is selected from the group consisting of SEQ ID NO: 
2, 4, 6, 8, 10, 11, 73 and 75. 

1 02. The method of claim 95, wherein said polynucleotide encoding an acyl 
CoA.cholesterol acyltransferase-like polypeptide is SEQ ID NO: 33 or 42. 

1 03. The method of claim 93, wherein said regulatory sequence is a tissue specific 
promoter. 

104. The method of claim 95, wherein said regulatory sequence is a tissue specific 
promoter. 

1 05 . The method of claim 93 , wherein said regulatory sequence is a seed specific promoter. 

106. The method of claim 95, wherein said regulatory sequence is a seed specific promoter. 

1 07. A plant comprising a recombinant construct containing a regulatory sequence 
operably linked to a polynucleotide encoding a lecithinxholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in an altered 
production of oil by said plant as compared to the same plant without said 

5 recombinant construct. 
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108. The plant of claim 107, wherein said lecithinxholesterol acyltransferase-like 
polypeptide is a plant lecithinxholesterol acyltransferase-like polypeptide. 

1 09. A plant comprising a recombinant construct containing a regulatory sequence 
operably linked to a polynucleotide encoding an acyl CoAxholesterol acyltransferase- 
like polypeptide wherein expression of said recombinant construct results in an altered 
production of oil by said plant as compared to the same plant without said 

5 recombinant construct. 

1 1 0. The plant of claim 1 09, wherein said acyl Co A: cholesterol acyltransferase-like 
polypeptide is a plant acyl CoAxholesterol acyltransferase-like polypeptide. 

10 111. The plant of claim 1 07, wherein said oil production is increased. 

112. The plant of claim 1 09, wherein said oil production is increased. 

113. The plant of claim 107, wherein said polynucleotide encoding a lecithinxholesterol 

1 5 acyltransferase-like polypeptide is selected from the group consisting of SEQ ED NO: 

2,4,6, 8, 10, 11, 73 and 75. 

114. The plant of claim 109, wherein said polynucleotide encoding an acyl 
CoAxholesterol acyltransferase-like polypeptide is SEQ ID NO: 33 or 42. 

20 

115. The plant of claim 107, wherein said regulatory sequence is a tissue specific promoter. 

1 16. The plant of claim 109, wherein said regulatory sequence is a tissue specific promoter. 
25 117. The plant of claim 107, wherein said regulatory sequence is a seed specific promoter. 

118. The plant of claim 1 09, wherein said regulatory sequence is a seed specific promoter. 

119. A food, food ingredient or food product comprising the oil of claim 90 or 92. 
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120. 



A food, food ingredient or food product comprising the plant of claim 107 or 109. 
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ABSTRACT 



The present invention is directed to lecithinxholesterol acyltransferase-like 
polypeptides (LCAT) and acyl CoA:cholesterol acyltransferases-like polypeptides (ACAT). 
The invention provides polynucleotides encoding such cholesterol:acyltransferases-like 
polypeptides, polypeptides encoded by such polynucleotides, and the use of such 
polynucleotides to alter sterol composition and oil production in plants and host cells. Also 
provided are oils produced by the plants and host cells containing the polynucleotides and 
food products, nutritional supplements, and pharmaceutical composition containing plants or 
oils of the present invention. The polynucleotides of the present invention include those 
derived from plant sources 
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Attorney's Docket No. MTC 6718 
DECLARATION AND POWER OF ATTORNEY 



REGULAR OR DESIGN APPLICATION 



As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated 
below next to my name. 

I believe I am the original, first and sole inventor (if only one 
name is listed below) or an original, first and joint inventor 
(if plural names are listed below) of the subject matter which is 
claimed and for which a patent is sought on the invention 
entitled : 

PLANT STEROL ACYLTRANSFERASES 
the specification of which: 
(check one) 

[ ] is attached hereto 

[ ] was filed on as Application Serial No. 

, and was amended on . 

[ ] was described and claimed in PCT International Application 

No. , filed on and as amended 

under PCT Article 19 on , if any. 



ACKNOWLEDGEMENT OF REVIEW OF PAPERS AND DUTY OF CANDOR 

I hereby state that I have reviewed and understand the contents 
of the above identified specification, including the claims, as 
amended by any amendment referred to above . 

I acknowledge the duty to disclose information which is material 
to patentability as defined in Title 37, Code of Federal 
Regulations §1.56. 
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PRIORITY CLAIM 



I hereby claim foreign priority benefits under Title 35, United 
States Code, §119 (a) - (d) or §365 (b) of any foreign application 
for patent or inventor's certificate, or §365 (a) of any PCT 
application which designates at least one country other than the 
United States of America, listed below and have also identified 
below any foreign application for patent or inventor's 
certificate having a filing date before that of the application 
on which priority is claimed: 

Priority Claimed 



(Number) (Country) (Day/Month/Year Filed) 



(Number) (Country) (Day/Month/Year Filed) 



(Number) (Country) (Day/Month/Year Filed) 

Priority Not Claimed 

ANY FOREIGN APPLICATION (S) , ON THE SAME SUBJECT MATTER WHICH HAS 
A FILING DATE EARLIER THAN THE EARLIEST APPLICATION FROM WHICH 
PRIORITY IS CLAIMED 



(Number) (Country) (Day/Month/Year Filed) 



CLAIM FOR BENEFIT OF PROVISIONAL APPLICATION (S) 

I hereby claim the benefit under Title 35, United States Code, 
§119 (e) of any United States provisional application (s) listed 
below. 

60/152 ,493 August 30, 1999 

(Application Number) (Filing Date) 



(Application Number) 



(Filing Date) 



CLAIM FOR BENEFIT OF EARLIER U.S. APPLICATION (S) 
UNDER 35 U.S.C. 120 

(complete this part only if this is a divisional, 
continuation or CIP application) 

I hereby claim the benefit under Title 35, United States Code, 
§120 of any United States application (s) , or §365 (c) of any PCT 
international application designating the United States of 
America, listed below and, insofar as the subject matter of each 
of the claims of this application is not disclosed in the prior 
United States application in the manner provided by the first 
paragraph of Title 35, United States Code §112, I acknowledge the 
duty to disclose information which is material to patentability 
as defined in Title 37, Code of Federal Regulations, §1.56 which 
became available between the filing date of the prior application 
and the national or PCT International filing date of this 
application: 



(Serial No.) (Filing Date) (Status) 



(Serial No.) (Filing Date) (Status) 



POWER OF ATTORNEY 

I hereby appoint the following attorneys to prosecute this 
application and to transact all business in the Patent and 
Trademark Office connected therewith: Irving Powers (15,700), 
Donald G. Leavitt (17,626), John K. Roedel, Jr. (25,914), Michael 
E. Godar (28,416), Edward J. Hejlek (31,525), William E. Lahey 

(26,757), Richard G. Heywood (18,224), Frank R. Agovino (27,416), 
Kurt F. James (33,716), G. Harley Blosser (33,650), Paul I. J. 
Fleischut (35,513), Vincent M. Keil (36,838), Robert M. Evans, 
Jr. (36,794), Robert M. Bain (36,736), Joseph A. Schaper 

(30,493), Kathleen M. Petrillo (35,076), David E. Crawford, Jr. 

(38,118), Paul A. Maddock (37,877), Richard L. Bridge (40,529), 
Christopher M. Goff (41,785), James E. Butler (40,931), Derick E. 
Allen (43,468), Matthew L. Cutler (43,574), Michael G. Munsell 

(43,820), Karen Y. Hui (44,785), Anthony R. Kinney (44,834), 
Brian P. Klein (44,837), Sarah J. Chickos (46,157), Donald W. 
Tuegel (45,424), Steven M. Ritchey (46,321), Michael J. Thomas 

(39,857), Kathryn J. Doty (40,593), and Laura R. Polcyn (P- 
47,000), all of the law firm of SENNIGER, POWERS, LEAVITT & 
ROEDEL, One Metropolitan Square, 16th Floor, St. Louis, Missouri 
63102 . 
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Send Correspondence To : 



Direct Telephone Calls To: 



Customer Number: 0 0 0321 



James E. Butler, Ph.D. 
(314) 231-5400 



I hereby declare that all statements made herein of my own 
knowledge are true and that all statements made on information 
and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful false statements may 
jeopardize the validity of the application or any patent issued 
thereon . 



Full name of sole or first inventor Michael Lassner 



Inventor's signature Date 

Residence Redwood City, CA Citizenship U. S . A 

Post Office address 515 Galveston Drive 



Full name of second joint inventor Alison Van Eenennaam 



Redwood City, California 94 063 



Second inventor's signature 



Date 



Residence 



Davis, California 



Citizenship Australia 



Post Office address 



856 Burr Street 



Davis, California 95616 
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SEQUENCE LISTING 

<110> Monsanto Company 

<120> PLANT STEROL ACYLTRANSFERASES 

<130> MTC6718 

<140> 
<141> 

<150> 60/152,493 
<151> 1999-08-30 

<160> 80 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 440 
<212> PRT 

<213> Homo sapiens 
<400> 1 

Met Gly Pro Pro Gly Ser Pro Trp Gin Trp Val Thr Leu Leu Leu Gly 



Leu Leu Leu Pro Pro Ala Ala Pro Phe Trp Leu Leu Asn Val Leu Phe 
20 25 30 

Pro Pro His Thr Thr Pro Lys Ala Glu Leu Ser Asn His Thr Arg Pro 
35 40 45 

Val lie Leu Val Pro Gly Cys Leu Gly Asn Gin Leu Glu Ala Lys Leu 



Asp Lys Pro Asp Val Val Asn Trp Met Cys Tyr Arg Lys Thr Glu Asp 



Phe Phe Thr lie Trp Leu Asp Leu Asn Met Phe Leu Pro Leu Gly Val 



Cys Trp lie Asp Asn Thr Arg Val Val Tyr Asn Arg Ser Ser Gly 
100 105 110 



Leu Val Ser Asn Ala Pro Gly 
115 



Val Gin lie Arg Val Pro Gly Phe Gly 
120 125 



Lys Thr Tyr Ser Val Glu Tyr Leu Asp Ser Ser Lys Leu Ala Gly Tyr 
130 135 140 



Leu His Thr Leu Val Gin Asn Leu Val Asn Asn Gly Tyr Val Arg Asp 
145 150 155 160 

Glu Thr Val Arg Ala Ala Pro Tyr Asp Trp Arg Leu Glu Pro Gly Gin 
165 170 175 

Gin Glu Glu Tyr Tyr Arg Lys Leu Ala Gly Leu Val Glu Glu Met His 
180 185 190 

Ala Ala Tyr Gly Lys Pro Val Phe Leu lie Gly His Ser Leu Gly Cys 
195 200 205 

Leu His Leu Leu Tyr Phe Leu Leu Arg Gin Pro Gin Ala Trp Lys Asp 
210 215 220 

Arg Phe lie Asp Gly Phe He Ser Leu Gly Ala Pro Trp Gly Gly Ser 
225 230 235 240 

He Lys Pro Met Leu Val Leu Ala Ser Gly Asp Asn Gin Gly He Pro 
245 250 255 

He Met Ser Ser He Lys Leu Lys Glu Glu Gin Arg He Thr Thr Thr 
260 265 270 

Ser Pro Trp Met Phe Pro Ser Arg Met Ala Trp Pro Glu Asp His Val 
275 280 285 

Phe He Ser Thr Pro Ser Phe Asn Tyr Thr Gly Arg Asp Phe Gin Arg 
290 295 300 

Phe Phe Ala Asp Leu His Phe Glu Glu Gly Trp Tyr Met Trp Leu Gin 
305 310 315 320 

Ser Arg Asp Leu Leu Ala Gly Leu Pro Ala Pro Gly Val Glu Val Tyr 
325 330 335 

Cys Leu Tyr Gly Val Gly Leu Pro Thr Pro Arg Thr Tyr He Tyr Asp 
340 345 350 

His Gly Phe Pro Tyr Thr Asp Pro Val Gly Val Leu Tyr Glu Asp Gly 
355 360 365 



Asp Asp Thr Val Ala Thr Arg Ser Thr Glu Leu Cys Gly Leu Trp Gin 
370 375 380 



Gly Arg Gin Pro Gin Pro Val His Leu Leu Pro Leu His Gly lie Gin 
385 390 395 400 



His Leu Asn Met Val Phe Ser Asn Leu Thr Leu Glu His lie Asn Ala 
405 410 415 



lie Leu Leu Gly Ala Tyr Arg Gin Gly Pro Pro Ala Ser Pro Thr Ala 
420 425 430 



Ser Pro Glu Pro Pro Pro Pro Glu 
435 440 



<210> 2 
<211> 1299 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 2 

atgaaaaaaa tatcttcaca 
acctcgatgt gtcaagctgt 
ggaggtaacc agctagaggt 
agcagctggt tatatccgat 
gcagcagtgt tattgtctcc 
gaccctgatt tggatgatta 
ggttcgacca aatcacttct 
gaacatttgg tgaaagctct 
ggagctccat atgatttcag 
tcacagttcc tacaagacct 
aagccagtga tactcctctc 
cgtaccaccc cttcatggcg 
tggggtggga cgatctctca 
ttagttaacc ctttgctggt 
cttccatcta ccaaagtgtt 
aactacacag cttacgagat 
gtgccttaca agacaagagt 
gtcacttgca tatatgggag 
ggattcgata agcaaccaga 
agcttagcag ctttgaaagt 
acatctatac ttaaagacga 
aattatgaat tagccaatgt 



ttattcggta gtcatagcga 
gggtagcaac gtgtaccctt 
acggctggac agagaataca 
tcataagaag agtggtggat 
cttcaccagg tgcttcagcg 
ccaaaatgct cctggtgtcc 
atacctcgac cctcgtctcc 
agagaaaaaa tgcgggtatg 
gtacggcctg gctgcttcgg 
caaacaattg gtggaaaaaa 
ccatagccta ggaggacttt 
ccgcaagtac atcaaacact 
gatgaagaca tttgcttctg 
cagacggcat cagaggacct 
tcacgacaga actaaaccgc 
ggatcggttt tttgcagaca 
gttgccttta acagaggagc 
aggagttgat acaccggagg 
gattaagtat ggagatggag 
cgatagcttg aacaccgtag 
gatcgcactt aaagagatta 
taatgccgtc aatgaatga 



tactcgttgt ggtgacgatg 60 
tgattctggt tccaggaaac 12 0 
agccaagtag tgtctggtgt 180 
ggtttaggct atggttcgat 24 0 
atcgaatgat gttgtactat 300 
aaacccgggt tcctcatttc 360 
gagatgccac atcttacatg 42 0 
ttaacgacca aaccatccta 480 
gccacccgtc ccgtgtagcc 540 
ctagcagcga gaacgaagga 600 
tcgtcctcca tttcctcaac 660 
ttgttgcact cgctgcgcca 72 0 
gcaacacact cggtgtccct 780 
ccgagagtaa ccaatggcta 84 0 
ttgtcgtaac tccccaggtt 900 
ttggattctc acaaggagtt 960 
tgatgactcc gggagtgcca 1020 
ttttgatgta tggaaaagga 1080 
atgggacggt taatttggcg 114 0 
agattgatgg agtttcgcat 1200 
tgaagcagat ttcaattatt 12 60 
1299 



<210> 3 
<211> 432 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 3 

Met Lys Lys He Ser Ser His Tyr Ser Val Val He Ala He Leu Val 
15 10 15 



Val Val Thr Met Thr Ser Met Cys Gin Ala Val Gly Ser Asn Val Tyr 



Pro Leu He Leu Val Pro Gly Asn Gly Gly Asn Gin Leu Glu Val Arg 



Leu Asp Arg Glu Tyr Lys Pro Ser Ser Val Trp Cys Ser Ser Trp Leu 



Tyr Pro He His Lys Lys Ser Gly Gly Trp Phe Arg Leu Trp Phe Asp 
65 70 75 80 

Ala Ala Val Leu Leu Ser Pro Phe Thr Arg Cys Phe Ser Asp Arg Met 
85 90 95 

Met Leu Tyr Tyr Asp Pro Asp Leu Asp Asp Tyr Gin Asn Ala Pro Gly 
100 105 110 

Val Gin Thr Arg Val Pro His Phe Gly Ser Thr Lys Ser Leu Leu Tyr 
115 120 125 

Leu Asp Pro Arg Leu Arg Asp Ala Thr Ser Tyr Met Glu His Leu Val 
130 135 140 

Lys Ala Leu Glu Lys Lys Cys Gly Tyr Val Asn Asp Gin Thr He Leu 
145 150 155 160 

Gly Ala Pro Tyr Asp Phe Arg Tyr Gly Leu Ala Ala Ser Gly His Pro 
165 170 175 

Ser Arg Val Ala Ser Gin Phe Leu Gin Asp Leu Lys Gin Leu Val Glu 
180 185 190 

Lys Thr Ser Ser Glu Asn Glu Gly Lys Pro Val He Leu Leu Ser His 
195 200 205 

Ser Leu Gly Gly Leu Phe Val Leu His Phe Leu Asn Arg Thr Thr Pro 
210 215 220 

Ser Trp Arg Arg Lys Tyr He Lys His Phe Val Ala Leu Ala Ala Pro 
225 230 235 240 



Trp Gly Gly Thr He Ser Gin Met Lys Thr Phe Ala Ser Gly Asn Thr 
245 250 255 



Leu Gly Val Pro Leu Val Asn Pro Leu Leu Val Arg Arg His Gin Arg 
260 265 270 



Thr Ser Glu Ser 
275 

Asp Arg Thr Lys 
290 

Tyr Glu Met Asp 
305 

Val Pro Tyr Lys 



Pro Gly Val Pro 
340 

Glu Val Leu Met 
355 

Lys Tyr Gly Asp 
370 

Leu Lys Val Asp 
385 

Thr Ser lie Leu 



He Ser He He 
420 



Asn Gin Trp Leu 
280 

Pro Leu Val Val 
295 

Arg Phe Phe Ala 
310 

Thr Arg Val Leu 
325 

Val Thr Cys He 



Tyr Gly Lys Gly 
360 

Gly Asp Gly Thr 
375 

Ser Leu Asn Thr 
390 

Lys Asp Glu He 
405 

Asn Tyr Glu Leu 



Leu Pro Ser Thr 



Thr Pro Gin Val 
300 

Asp He Gly Phe 
315 

Pro Leu Thr Glu 
330 

Tyr Gly Arg Gly 
345 

Gly Phe Asp Lys 



Val Asn Leu Ala 
380 

Val Glu He Asp 
395 

Ala Leu Lys Glu 
410 

Ala Asn Val Asn 
425 



Lys Val Phe His 
285 

Asn Tyr Thr Ala 



Ser Gin Gly Val 
320 

Glu Leu Met Thr 
335 

Val Asp Thr Pro 
350 

Gin Pro Glu He 
365 

Ser Leu Ala Ala 



Gly Val Ser His 
400 

He Met Lys Gin 
415 

Ala Val Asn Glu 
430 



<210> 4 
<211> 1641 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 4 

atgggagcga attcgaaatc agtaacggct 
atttgcggtg gccgaactgc ggtggaggat 
ctatcgggta taatcattcc gggatttgcg 
gactgtccat acactccgtt ggacttcaat 



tccttcaccg tcatcgccgt ttttttcttg 60 
gagaccgagt ttcacggcga ctactcgaag 12 0 
tcgacgcagc tacgagcgtg gtcgatcctt 180 
ccgctcgacc tcgtatggct agacaccact 24 0 



aagcttcttt ctgctgtcaa ctgctggttt aagtgtatgg tgctagatcc ttataatcaa 300 
acagaccatc ccgagtgtaa gtcacggcct gacagtggtc tttcagccat cacagaattg 360 
gatccaggtt acataacagg tcctctttct actgtctgga aagagtggct taagtggtgt 42 0 
gttgagtttg gtatagaagc aaatgcaatt gtcgctgttc catacgattg gagattgtca 480 
ccaaccaaat tggaagagcg tgacctttac tttcacaagc tcaagttgac ctttgaaact 54 0 
gctttaaaac tccgtggcgg cccttctata gtatttgccc attcaatggg taataatgtc 600 
ttcagatact ttctggaatg gctgaggcta gaaattgcac caaaacatta tttgaagtgg 660 
cttgatcagc atatccatgc ttatttcgct gttggagctc ctcttcttgg ttctgttgag 72 0 
gcaatcaaat ctactctctc tggtgtaacg tttggccttc ctgtttctga gggaactgct 780 
cggttgttgt ccaattcttt tgcgtcgtca ttgtggctta tgccattttc aaagaattgc 840 
aagggtgata acacatcctg gacgcatttt tctgggggtg ctgcaaagaa agataagcgc 900 
gtataccact gtgatgaaga ggaatatcaa tcaaaatatt ctggctggcc gacaaatatt 960 
attaacattg aaattccttc cactagcgtt acagaaacag ctctagtcaa catgaccagc 102 0 
atggaatgtg gccttcccac ccttttgtct ttcacagccc gtgaactagc agatgggact 1080 
cttttcaaag caatagaaga ctatgaccca gatagcaaga ggatgttaca ccagttaaag 1140 
aagttgtatc atgatgaccc tgtttttaat cctctgactc cttgggagag accacctata 12 0 0 
aaaaatgtat tttgcatata tggtgctcat ctaaagacag aggttggtta ttactttgcc 12 60 
ccaagtggca aaccttatcc tgataattgg atcatcacgg atatcattta cgaaactgaa 132 0 
ggttccctcg tgtcaaggtc tggaactgtg gttgatggga acgctggacc tataactggg 13 80 
gatgagacgg taccctatca ttcactctct tggtgcaaga attggctcgg acctaaagtt 1440 
aacataacaa tggctcccca gccagaacac gatggaagcg acgtacatgt ggaactaaat 1500 
gttgatcatg agcatgggtc agacatcata gctaacatga caaaagcacc aagggttaag 1560 
tacataacct tttatgaaga ctctgagagc attccgggga agagaaccgc agtctgggag 162 0 
cttgataaaa gtgggtatta a 1641 



<210> 5 
<211> 546 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 5 

Met Gly Ala Asn Ser Lys Ser Val Thr Ala Ser Phe Thr Val lie Ala 
15 10 15 

Val Phe Phe Leu He Cys Gly Gly Arg Thr Ala Val Glu Asp Glu Thr 
20 25 30 

Glu Phe His Gly Asp Tyr Ser Lys Leu Ser Gly He He He Pro Gly 
35 40 45 

Phe Ala Ser Thr Gin Leu Arg Ala Trp Ser He Leu Asp Cys Pro Tyr 
50 55 60 

Thr Pro Leu Asp Phe Asn Pro Leu Asp Leu Val Trp Leu Asp Thr Thr 
65 70 75 80 



Lys Leu Leu Ser Ala Val Asn Cys Trp Phe Lys Cys Met Val Leu Asp 



Pro Tyr Asn Gin Thr Asp His Pro Glu Cys Lys Ser Arg Pro Asp Ser 
100 105 110 

Gly Leu Ser Ala lie Thr Glu Leu Asp Pro Gly Tyr lie Thr Gly Pro 
115 120 125 

Leu Ser Thr Val Trp Lys Glu Trp Leu Lys Trp Cys Val Glu Phe Gly 
130 135 140 

lie Glu Ala Asn Ala lie Val Ala Val Pro Tyr Asp Trp Arg Leu Ser 
145 150 155 160 

Pro Thr Lys Leu Glu Glu Arg Asp Leu Tyr Phe His Lys Leu Lys Leu 
165 170 175 

Thr Phe Glu Thr Ala Leu Lys Leu Arg Gly Gly Pro Ser lie Val Phe 
180 185 190 

Ala His Ser Met Gly Asn Asn Val Phe Arg Tyr Phe Leu Glu Trp Leu 
195 200 205 

Arg Leu Glu lie Ala Pro Lys His Tyr Leu Lys Trp Leu Asp Gin His 
210 215 220 

lie His Ala Tyr Phe Ala Val Gly Ala Pro Leu Leu Gly Ser Val Glu 
225 230 235 240 

Ala lie Lys Ser Thr Leu Ser Gly Val Thr Phe Gly Leu Pro Val Ser 
245 250 255 

Glu Gly Thr Ala Arg Leu Leu Ser Asn Ser Phe Ala Ser Ser Leu Trp 
260 265 270 

Leu Met Pro Phe Ser Lys Asn Cys Lys Gly Asp Asn Thr Ser Trp Thr 
275 280 285 

His Phe Ser Gly Gly Ala Ala Lys Lys Asp Lys Arg Val Tyr His Cys 
290 295 300 

Asp Glu Glu Glu Tyr Gin Ser Lys Tyr Ser Gly Trp Pro Thr Asn lie 
305 310 315 320 

lie Asn lie Glu lie Pro Ser Thr Ser Val Thr Glu Thr Ala Leu Val 
325 330 335 



Asn Met Thr Ser Met Glu Cys Gly Leu Pro Thr Leu Leu Ser Phe Thr 



340 



345 



350 



Ala Arg Glu Leu Ala Asp Gly Thr Leu Phe Lys Ala lie Glu Asp Tyr 
355 360 365 

Asp Pro Asp Ser Lys Arg Met Leu His Gin Leu Lys Lys Leu Tyr His 
370 375 380 

Asp Asp Pro Val Phe Asn Pro Leu Thr Pro Trp Glu Arg Pro Pro He 
385 390 395 400 

Lys Asn Val Phe Cys He Tyr Gly Ala His Leu Lys Thr Glu Val Gly 
405 410 415 

Tyr Tyr Phe Ala Pro Ser Gly Lys Pro Tyr Pro Asp Asn Trp He He 
420 425 430 

Thr Asp He He Tyr Glu Thr Glu Gly Ser Leu Val Ser Arg Ser Gly 
435 440 445 

Thr Val Val Asp Gly Asn Ala Gly Pro He Thr Gly Asp Glu Thr Val 
450 455 460 

Pro Tyr His Ser Leu Ser Trp Cys Lys Asn Trp Leu Gly Pro Lys Val 
465 470 475 480 

Asn He Thr Met Ala Pro Gin Pro Glu His Asp Gly Ser Asp Val His 
485 490 495 

Val Glu Leu Asn Val Asp His Glu His Gly Ser Asp He He Ala Asn 
500 505 510 

Met Thr Lys Ala Pro Arg Val Lys Tyr He Thr Phe Tyr Glu Asp Ser 
515 520 525 

Glu Ser He Pro Gly Lys Arg Thr Ala Val Trp Glu Leu Asp Lys Ser 
530 535 540 

Gly Tyr 
545 



<210> 6 
<211> 1608 
<212> DNA 

<213> Arabidopsis thaliana 



:400> 6 



atgtctctat tactggaaga gatcattaga tcagtagagg ctttgctgaa gctcagaaat 60 
cggaatcaag aaccctatgt tgacccgaat ctaaacccgg ttcttctagt tccaggaatc 120 
gctggatcaa ttctaaacgc cgttgatcat gagaacggga acgaagaacg tgtttgggtt 180 
aggatctttg gtgctgatca tgagtttcga acaaagatgt ggtctcgatt tgatccttca 240 
actggtaaaa cgatatctct tgatccaaaa acgagtattg ttgttcctca agacagagct 3 00 
gggctacatg caattgatgt cttagaccct gatatgattg ttggccgtga gtctgtgtac 360 
tatttccatg agatgattgt tgagatgatc ggatggggat ttgaagaagg gaaaaccctt 42 0 
tttggttttg gttatgattt ccgccaaagc aacagactgc aggaaacgtt ggaccagttt 4 80 
gctaaaaagt tggaaactgt ttataaagcc tcaggagaga agaagattaa tgttattagt 540 
cattctatgg gaggactatt ggtgaaatgt ttcatgggtc tgcatagtga tatattcgag 600 
aagtatgtac agaattggat tgctattgct gctccatttc gaggtgctcc tggatatatc 660 
acatcgactt tattgaatgg aatgtcgttt gtcaatggtt gggaacagaa ctttttcgtc 720 
tctaagtgga gcatgcatca gctgcttatt gagtgtccat ccatatatga gctgatgtgt 780 
tgtccgtatt ttaaatggga gctccctccc gtcttagagc tgtggagaga gaaagagagc 840 
aatgatggag ttggaacctc tgatgttgtt cttgagtctt accgtagcct ggagagcctt 900 
gaagttttta cgaaatctct ttcgaataat acagctgatt attgtggaga gtcgatcgat 960 
cttcctttta actggaagat catggagtgg gctcacaaaa ccaagcaagt attagcctct 102 0 
gccaagctgc ctccgaaagt taaattctat aacatatatg ggaccaatct agaaacccct 1080 
catagtgttt gctatgggaa tgagaagatg cccgttaaag atctaacgaa tctaagatac 1140 
ttccagccga catatatatg cgtggatggt gatggcacag tcccgatgga atctgccatg 1200 
gcggatgggc ttgaagcagt agcaagagtt ggagtccctg gtgagcaccg aggaatcctc 12 60 
aacgatcacc gtgtcttccg aatgctcaaa aaatggctaa atgtaggcga accagacccg 132 0 
ttctacaacc cagtaaacga ttatgtcatc cttcccacca catatgaatt tgagaaattc 13 8 0 
catgagaatg gactcgaggt tgcttccgtg aaagaatcgt gggacatcat atcagatgac 144 0 
aacaatatcg gcacaaccgg gtcaaccgtg aactccatat cagtctctca acctggagat 1500 
gatcaaaacc ctcaagctga agctcgtgca accttaaccg tccaaccaca aagcgatggt 1560 
agacaacatg tagagctcaa tgctgtaagt gtctctgttg atgcataa 1608 



<210> 7 
<211> 535 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 7 

Met Ser Leu Leu Leu Glu Glu lie lie Arg Ser Val Glu Ala Leu Leu 
15 10 15 

Lys Leu Arg Asn Arg Asn Gin Glu Pro Tyr Val Asp Pro Asn Leu Asn 
20 25 30 

Pro Val Leu Leu Val Pro Gly lie Ala Gly Ser lie Leu Asn Ala Val 
35 40 45 

Asp His Glu Asn Gly Asn Glu Glu Arg Val Trp Val Arg lie Phe Gly 
50 55 60 



Ala Asp His Glu Phe Arg Thr Lys Met Trp Ser Arg Phe Asp Pro Ser 



65 70 75 80 

Thr Gly Lys Thr lie Ser Leu Asp Pro Lys Thr Ser He Val Val Pro 
85 90 95 

Gin Asp Arg Ala Gly Leu His Ala He Asp Val Leu Asp Pro Asp Met 
100 105 110 

He Val Gly Arg Glu Ser Val Tyr Tyr Phe His Glu Met He Val Glu 
115 120 125 

Met He Gly Trp Gly Phe Glu Glu Gly Lys Thr Leu Phe Gly Phe Gly 
130 135 140 

Tyr Asp Phe Arg Gin Ser Asn Arg Leu Gin Glu Thr Leu Asp Gin Phe 
145 150 155 160 

Ala Lys Lys Leu Glu Thr Val Tyr Lys Ala Ser Gly Glu Lys Lys He 
165 170 175 

Asn Val He Ser His Ser Met Gly Gly Leu Leu Val Lys Cys Phe Met 
180 185 190 

Gly Leu His Ser Asp He Phe Glu Lys Tyr Val Gin Asn Trp He Ala 
195 200 205 

He Ala Ala Pro Phe Arg Gly Ala Pro Gly Tyr He Thr Ser Thr Leu 
210 215 220 

Leu Asn Gly Met Ser Phe Val Asn Gly Trp Glu Gin Asn Phe Phe Val 
225 230 235 240 

Ser Lys Trp Ser Met His Gin Leu Leu He Glu Cys Pro Ser He Tyr 
245 250 255 

Glu Leu Met Cys Cys Pro Tyr Phe Lys Trp Glu Leu Pro Pro Val Leu 
260 265 270 

Glu Leu Trp Arg Glu Lys Glu Ser Asn Asp Gly Val Gly Thr Ser Asp 
275 280 285 

Val Val Leu Glu Ser Tyr Arg Ser Leu Glu Ser Leu Glu Val Phe Thr 
290 295 300 

Lys Ser Leu Ser Asn Asn Thr Ala Asp Tyr Cys Gly Glu Ser He Asp 
305 310 315 320 



Leu Pro Phe Asn Trp Lys He Met Glu Trp Ala His Lys Thr Lys Gin 



325 



330 



335 



Val Leu Ala Ser Ala Lys Leu Pro Pro Lys Val Lys Phe Tyr Asn lie 
340 345 350 

Tyr Gly Thr Asn Leu Glu Thr Pro His Ser Val Cys Tyr Gly Asn. Glu 
355 360 365 

Lys Met Pro Val Lys Asp Leu Thr Asn Leu Arg Tyr Phe Gin Pro Thr 
370 375 380 

Tyr lie Cys Val Asp Gly Asp Gly Thr Val Pro Met Glu Ser Ala Met 
385 390 395 400 

Ala Asp Gly Leu Glu Ala Val Ala Arg Val Gly Val Pro Gly Glu His 
405 410 415 

Arg Gly lie Leu Asn Asp His Arg Val Phe Arg Met Leu Lys Lys Trp 
420 425 430 

Leu Asn Val Gly Glu Pro Asp Pro Phe Tyr Asn Pro Val Asn Asp Tyr 
435 440 445 

Val lie Leu Pro Thr Thr Tyr Glu Phe Glu Lys Phe His Glu Asn Gly 
450 455 460 

Leu Glu Val Ala Ser Val Lys Glu Ser Trp Asp lie lie Ser Asp Asp 
465 470 475 480 

Asn Asn lie Gly Thr Thr Gly Ser Thr Val Asn Ser lie Ser Val Ser 
485 490 495 

Gin Pro Gly Asp Asp Gin Asn Pro Gin Ala Glu Ala Arg Ala Thr Leu 
500 505 510 

Thr Val Gin Pro Gin Ser Asp Gly Arg Gin His Val Glu Leu Asn Ala 
515 520 525 

Val Ser Val Ser Val Asp Ala 
530 535 



<210> 8 

<211> 1344 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 8 



atgggctgga ttccgtgtcc gtgctgggga accaacgacg atgaaaacgc cggcgaggtg 60 
gcggatcgtg atccggtgct tctagtatct ggaattggag gctctattct gcattctaag 120 
aagaagaatt caaagtctga aattcgggtt tgggtccgaa tatttctagc taaccttgcc 180 
tttaagcaga gcctctggtc tctctataat cccaaaactg gttatacaga gccgttggat 240 
gataatattg aagtattggt ccctgatgat gaccatggac tctatgcaat tgacattcta 300 
gatccctctt ggtttgtaaa gctttgtcac ttgacggagg tttatcactt tcacgatatg 360 
atagaaatgc tggttggatg cggttataag aaggggacta cattattcgg ttatggttac 420 
gatttccgtc aaagcaatag gatcgatcta cttatactag gtctgaagaa gaagctggaa 4 80 
actgcatata aacgttcagg ggggagaaaa gtcactatca tctcccattc aatgggagga 540 
cttatggttt catgtttcat gtatctccat ccggaggcat tttccaagta tgtaaataaa 600 
tggattacaa ttgcaacacc tttccaagga gcaccagggt gcatcaatga ttcaatcttg 660 
actggagtgc aatttgtgga agggttagaa agtttctttt ttgtgtcacg ttggacgatg 720 
caccaactgt tggtcgaatg cccatctata tatgagatga tggcaaatcc agactttaag 780 
tggaaaaagc aaccagagat tcgagtttgg cgtaagaaat ctgaaaacga cgttgatact 840 
tctgtagaac tggaatcatt tggcttaatc gagagtattg atctattcaa cgatgcatta 900 
aaaaataacg agctaagcta tggtgggaat aaaatagctt tgccctttaa ctttgctatc 960 
ctcgactggg ctgctaagac aagagaaatt ctcaacaaag cgcaacttcc tgatggagtg 102 0 
tccttctata acatatatgg agtgtcactt aatacaccct ttgatgtttg ttatggcaca 1080 
gagacttctc cgatagacga tttgtctgaa atatgtcaaa ctatgcctga gtatacatat 1140 
gtagatggag atggaactgt ccctgctgaa tcagctgcag ctgctcagtt taaagcagtt 12 00 
gctagcgtag gagtttcggg tagccaccgc gggcttctcc gtgatgaaag agtgtttgag 1260 
ctcattcaac aatggttagg agttgagccc aagaaggcta aacggaagca tttaaggact 132 0 
cacaaagtag ttgattctgg ttaa 1344 



<210> 9 
<211> 447 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 9 

Met Gly Trp lie Pro Cys Pro Cys Trp Gly Thr Asn Asp Asp Glu Asn 
15 10 15 

Ala Gly Glu Val Ala Asp Arg Asp Pro Val Leu Leu Val Ser Gly lie 
20 25 30 

Gly Gly Ser lie Leu His Ser Lys Lys Lys Asn Ser Lys Ser Glu lie 
35 40 45 

Arg Val Trp Val Arg lie Phe Leu Ala Asn Leu Ala Phe Lys Gin Ser 
50 55 60 

Leu Trp Ser Leu Tyr Asn Pro Lys Thr Gly Tyr Thr Glu Pro Leu Asp 
65 70 75 80 



Asp Asn He Glu Val Leu Val Pro Asp Asp Asp His Gly Leu Tyr Ala 
85 90 95 



lie Asp lie Leu Asp Pro Ser Trp Phe Val Lys Leu Cys His Leu Thr 
100 105 110 



Glu Val Tyr His Phe His Asp Met He Glu Met Leu Val Gly Cys Gly 
115 120 125 

Tyr Lys Lys Gly Thr Thr Leu Phe Gly Tyr Gly Tyr Asp Phe Arg Gin 
130 135 140 

Ser Asn Arg He Asp Leu Leu He Leu Gly Leu Lys Lys Lys Leu Glu 
145 150 155 160 

Thr Ala Tyr Lys Arg Ser Gly Gly Arg Lys Val Thr He He Ser His 
165 170 175 

Ser Met Gly Gly Leu Met Val Ser Cys Phe Met Tyr Leu His Pro Glu 
180 185 190 

Ala Phe Ser Lys Tyr Val Asn Lys Trp He Thr He Ala Thr Pro Phe 
195 200 205 

Gin Gly Ala Pro Gly Cys He Asn Asp Ser He Leu Thr Gly Val Gin 
210 215 220 

Phe Val Glu Gly Leu Glu Ser Phe Phe Phe Val Ser Arg Trp Thr Met 
225 230 235 240 

His Gin Leu Leu Val Glu Cys Pro Ser He Tyr Glu Met Met Ala Asn 
245 250 255 

Pro Asp Phe Lys Trp Lys Lys Gin Pro Glu He Arg Val Trp Arg Lys 
260 265 270 

Lys Ser Glu Asn Asp Val Asp Thr Ser Val Glu Leu Glu Ser Phe Gly 
275 280 285 

Leu He Glu Ser He Asp Leu Phe Asn Asp Ala Leu Lys Asn Asn Glu 
290 295 300 

Leu Ser Tyr Gly Gly Asn Lys He Ala Leu Pro Phe Asn Phe Ala He 
305 310 315 320 

Leu Asp Trp Ala Ala Lys Thr Arg Glu He Leu Asn Lys Ala Gin Leu 
325 330 335 



Pro Asp Gly Val Ser Phe Tyr Asn He Tyr Gly Val Ser Leu Asn Thr 
340 345 350 



Pro Phe Asp Val 
355 

Ser Glu lie Cys 
370 

Gly Thr Val Pro 
385 

Ala Ser Val Gly 



Arg Val Phe Glu 
420 

Ala Lys Arg Lys 
435 



Cys Tyr Gly Thr 
360 

Gin Thr Met Pro 
375 

Ala Glu Ser Ala 
390 

Val Ser Gly Ser 
405 

Leu lie Gin Gin 



His Leu Arg Thr 
440 



Glu Thr Ser Pro 



Glu Tyr Thr Tyr 
380 

Ala Ala Ala Gin 
395 

His Arg Gly Leu 
410 

Trp Leu Gly Val 
425 

His Lys Val Val 



lie Asp Asp Leu 
365 

Val Asp Gly Asp 



Phe Lys Ala Val 
400 

Leu Arg Asp Glu 
415 

Glu Pro Lys Lys 
430 

Asp Ser Gly 
445 



<210> 10 
<211> 3107 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 10 

cctttttgat ctttcagctc aatgagcttt 
aatttcaaag tttccacatc gagtttattc 
tgttatccag ctttgaactc ctcccgaccc 
tgtgggttgc atctttgtta cgatctgcat 
tgctatggta gagatgggca atgttattgt 
cttaactaat caattatctc tttgattcag 
tgacaatgaa actgggttgg atccagctgg 
ggctgctgac tactttgctc ctggctactt 
acatattgga tatgaagaga aaaatatgta 
tcagaacaca gaggttcttt tctcatcgtt 
ctttcttcat tacttaaggc ttaaatatgt 
actcttagcc gtatgaaaag taatatagag 
gcagttatag ttccgcattc catgggggtc 
gaggcaccag ctcctctggg tggcgggggt 
gcggtgatga acattggtgg accatttctt 
tctgctgaag caaaggatgt tgcagttgcc 
gatcagaacc ttggctctgg aactcaaagt 
tgctatatta tcgctgcaac tgacattggt 
tctcttgaga ttagacaaat gatgaattga 
gtttcttagc ttcgacgata acatttgtca 
tatggaaagc gctggagctt ccatgcttgc 
tttcaatttt cttgtatatg catctatggt 



tctcaatttt ttgggggaac tgaatatgtg 60 
acacgtcttg aatttcgtcc atcctcgttc 12 0 
tgctatggat atattaaaaa aaaagtgttt 180 
cttcttcttt cggctcagtg ttcatgtttt 240 
tgatggtaac agtggtatag ttgatagtat 300 
gcctctatgt tgggtggaac acatgtcact 360 
tattagagtt cgagctgtat caggactcgt 42 0 
tgtctgggca gtgctgattg ctaaccttgc 480 
catggctgca tatgactggc ggctttcgtt 54 0 
ctttctatta ttctgttcca tgttacgttt 600 
ttcatgttga attaataggt acgtgatcag 660 
ttgatggttt ctaccaatgg tggaaaaaaa 72 0 
ttgtattttc tacattttat gaagtgggtt 78 0 
gggccagatt ggtgtgcaaa gtatattaag 84 0 
ggtgttccaa aagctgttgc agggcttttc 900 
aggtattgaa tatctgctta tacttttgat 960 
tattctacta aatatcaatt ctaataacat 1020 
tgattatttt gctgcttatg taactgaaac 108 0 
taattcttac gcattgctct gtgatgacca 114 0 
tactgtcttt tggagggcat tgaattttgc 12 0 0 
attctttacc aattagcatt attctgcttc 1260 
cttttatttc ttcttaatta aagactcgtt 1320 



ggagtagttg ctctattagt cgcttggttc 
ttgcagagcg attgccccag gattcttaga 
gcatgtaatg agaatgacac gcacatggga 
tgacacaata tggggcgggc ttgattggtc 
aaagcaaaag aacaacgaaa cttgtggtga 
tcctgttaac tatggaagga tgatatcttt 
tgagattaat aatattgatt ttcgagtaag 
tttgtgattg tatgatgaat atctgtacat 
agagtatccc aaatcacacc tgtcgtgacg 
ctgggatcaa agctatcgct gagtataagg 
tacattatgt tgctcctaag atgatggcgc 
ctgatgattt ggatgacacc aagtatcaag 
caaagtaagt gatttcttga ttccaactgt 
ttgttttcgg tcttgttgga tatggttttc 
ctttctcaaa aaggcttgct cagttatatt 
gcttataaat cctccgtttg gtttgttctg 
ggaaatctac tcattatacg gagtggggat 
taaccagtct cccgacagtt gcatcccctt 
cgaagatagc tgtctgaaag caggagttta 
cctaagtgcc gggtacatgt gtgcaaaagc 
cggaatcaag acttacataa gagaatacaa 
gcgcgggacg cagagtggtg cccatgttga 
tatcatgagg gttgccgccg gaggtaacgg 
tggcatattt gaatggtcgg agcgtattga 
agctgtcctg tcagcttatg tgaatccaat 
catcatcgtc atcatcatga tgctcaactc 
cgaaattctc aatacctctt taatattctt 
tgtttgaacg ataacgcaaa acttgctgcg 
atttgtgggt tatacgtagt gtagaggatg 
agttaattct gaaaatgcaa caccacatga 



cttaatatag aactttactt tcttcgaaaa 13 80 
caccgatata tttagacttc agaccttgca 1440 
ctcaacaatg tctatgttac cgaagggagg 1500 
accggagaaa ggccacacct gttgtgggaa 1560 
agcaggtgaa aacggagttt ccaagaaaag 162 0 
tgggaaagaa gtagcagagg ctgcgccatc 1680 
gacatataaa tcataataaa ccttgtacat 1740 
tttatctggt gaagggtgct gtcaaaggtc 18 00 
tgtggacaga gtaccatgac atgggaattg 1860 
tctacactgc tggtgaagct atagatctac 192 0 
gtggtgccgc tcatttctct tatgggattg 1980 
atcccaaata ctggtcaaat ccgttagaga 2040 
atccttcgtc ctgatgcatt atcagtcttt 2100 
agctcaaagc ttacaaagct gtttctgagc 2160 
gaggtgctaa agttgataca tgtgactctt 2220 
ctttttcaga ttaccgaatg ctcctgagat 2280 
accaacggaa cgagcatacg tatacaagct 2340 
tcagatattc acttctgctc acgaggagga 2400 
caatgtggat ggggatgaaa cagtacctgt 2460 
gtggcgtggc aagacaagat tcaacccttc 252 0 
tcactctccg ccggctaacc tgttggaagg 2580 
tatcatggga aactttgctt tgatcgaaga 264 0 
gtctgatata ggacatgacc aggtccactc 2700 
cctgaagctg tgaatatcat gatctcttta 2760 
actttgaaag agagatcatc atcaattcat 2 82 0 
acaaagaagc ctgagaatga tactttggtg 2880 
attgaatgta aattatacaa tcctatctaa 2940 
ccatgtttgt ttgtcttgtc aaaagcatca 3000 
attcaaattt gtgataaatt tggtaatcaa 3060 
actatgtcac taaggcc 3107 



<210> 11 

<211> 1680 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> unsure 

<222> (694) 

<223> n=unkown 

<400> 11 

cgcataaggt gttcgagtgt ttgcagcttg 
aaagatctga accataaaaa tgaccaatca 
aaaaatgtta tagttctcat gaatactaac 
catggaccta acctggcgta acggtatgtc 
ggctttaggg acacctaaaa aggcgggtcc 



agaagttccg agtccaagag acctggagcc 60 
aaatccatta agccaattca aatattcact 120 
ataacaagtg aaagtaaatt taaaaatgtt 180 
tttgccttca gcagaaagta aattactgac 24 0 
aatgttgacg acggatttga tgtgtttggc 3 00 



acaccaacct ggaccacccc caccgcctcc atcaggaaga ggtgtttcta cccatttaag 360 
gaagtgaagg aaatagatag cccccattga atgcggaacc accacaactt tcttaaaccc 42 0 
attggtggca tacattagct cgattttgct cttcagtcta cttaacgatt ggtcacgtac 480 
ctgctcggtt tcaatccaaa aactatagat tagtccaaag ctctacaaca atatgtaatt 540 
acatacacta aagtagctaa tcatggaggt cttatagtat atcattatca tcattctcta 60 0 
gaccaccagt gttgtcaatg tgatcatata ggtattaata acgactaatc tgagcatacc 660 
tcggtgttat ggaaagagag tctccaatca taangaggcc atgtgaaggt tcttgccttc 72 0 
atatccaatt tttgccaaat tctctatgag aactgcccaa gcaaagtagc atggtgcgaa 780 
atagtctgca gccactagtc ctgggactgc tcggacacgg attcccggtg gatcgagacc 84 0 
ggtctcactg tctagagata agtgctccaa ccagcacaat ggcctataaa tcaaattaca 90 0 
acaattaaac gaccaagtat acacttcaaa ctaattcaga attgagaaaa tcgaaatgct 960 
aaccagaaaa tcatgtaaat caaaaaccgt aacaatcaat atatatatat atattttcca 1020 
gaatccatgt taaaaccata accaaaaata tatgaaaatt tagaaatact aaaataatat 1080 
gttaaaactg atattctaaa tttagtaagt tttaaaatgc aatgaaatcg tcattcatgt 1140 
tttgaacata aatatatttt atagttttgt aggacgattt tctacttcct atatagaaat 1200 
caaaacttac ggtttccatt tccaaattcg aatgacattt aaaaacatat cccaaaaatc 1260 
acgattaatt attaatttcc taaaaccatc catcattact tagaaaataa tattttcata 1320 
aactagttgc aaaacaataa caaaacccaa agaaccatct ccacccatta accaaaatga 13 80 
aaatccaaag accatccata acaacaacag tataacacta cgtaaagcca attcaagaag 1440 
aaaccaagct taaccaatta tacatacccc taaccagacg aaccaaacca atatctgacc 1500 
gggtctataa aaatatctcg aaccgaacat aacggtctaa tgtgttacct tctaagaatc 1560 
tcggagaagc tagcacccca aagacgttta cgaaagagtc cttcagcgca aggccgacct 1620 
tcccaaagct cgagcccgcc ggttacaatc cccggaacaa gaatcaccgg atgaaacgcc 16 80 



<210> 12 

<211> 264 

<212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (39) 
<223> n=unknown 

<220> 

<221> unsure 
<222> (175) 
<223> n=unknown 

<220> 

<221> unsure 
<222> (241) 
<223> n=unknown 

<400> 12 

ccaagaactc gatgattact tcaacactcc tggggttgng acccgggtcc ctcactttgg 60 
ttccaccaac tctcttctct catctcaatc ctcgtctcaa gcatatcacc ggatacatgg 12 0 



cacccctggt agattcatta caaaagcttg gctacgctga tggtgagact ctgtntggag 180 
ccccttatga ctttagatat ggtctagctg ctgaaggtca cccttcacaa gtgggttcca 24 0 
ngttcctcaa agatctaaag aatt 264 



<210> 13 

<211> 273 

<212> DNA 

<213> Glycine max 

<220> 

<2 21> unsure 
<222> (12) 
<223> n=unknown 

<220> 

<221> unsure 
<222> (33) 
<223> n=unknown 

<220> 

<221> unsure 
<222> (252) 
<223> n=unknown 

<220> 

<221> unsure 
<222> (265) . . (256) 
<223> n=unknown 

<220> 

<221> unsure 
<222> (272) 
<223> n=unknown 

<400> 13 

ccaacatctg anaggggtag agtaggtatt tcnatctatt atccatattg tgatgaagaa 60 
ggaacaagaa gagggtctca agattgaggt tgctacactc acagttacag tagttgttgt 12 0 
gatgctgtca ttgctatgca catgtggggc aagcaacctc gaccctttga ttctaatacc 180 
aggtaacgga gggaaccaac tagaagcaag gttgaccaat cagtacaagc cctctacttt 240 
catctgcgat cntggtaccc tctcannaag ana 2 73 



<210> 14 

<211> 419 

<212> DNA 

<213> Glycine max 



<220> 

<221> unsure 

<222> (99) 

<223> n=unknown 

<220> 

<221> unsure 
<222> (346) 
<223> n=unknown 

<220> 

<221> unsure 
<222> (352) 
<223> n=unknown 

<220> 

<221> unsure 
<222> (392) 
<223> n=unknown 

<220> 

<221> unsure 
<222> (405) 
<223> n= unknown 

<220> 

<221> unsure 

<222> (418) 

<22 3> n=unknown 



agcatttcag aacactgagg tgagggatca aacactaagt 60 
acttatggtt gctactaang gtggaaataa ggcagttatt 120 
cttgtacttc ctacatttta tgaaatgggt tgaagcacca 180 
aggaccagat tggtgctcca aatatataaa ggcagttgta 240 
aggtgttccc aaggctatag cagggctatt ctcagctgag 3 00 
caggacgata gctccaggat ttttanataa cnatctgttt 3 60 
catgtaatga anatgaaccc gttcnttggg actcaacna 419 



<400> 14 

gctgcatatg attggagaat 
cggataaaaa gcaacataga 
attccacatt caatgggggt 
gctccaatgg gtggtggggg 
aacattggtg gaccattttt 
gccaaggata ttgctgttgc 
ccgcattcaa acccttgcaa 



<210> 15 

<211> 272 

<212> DNA 

<213> Glycine max 



<220> 

<221> unsure 
<222> (1) . . (272) 



<223> n=unknown 



<400> 15 

tganttgatc ntgngaagtn attctgtgta 
gtaagtgang ggtntgaaga gggaaagacg 
agcaacaggt tgcaggaaac aatggatcgg 
gccgcaggnn ggaagacaat aaacattata 
atgtttcntg tgcctgcaaa gcgatatttt 



ttanttccat gacatgaccg ttnnagatnc 60 
ctttttggtn ttngatatga ttttcgccaa 120 
ttggctgcna agttagaatc aanttataat 180 
nttcattcta tgggcggtct tttccnngan 240 
ga 2 72 



<210> 16 

<211> 237 

<212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (1) . . (237) 
<223> n=unknown 

<400> 16 

gattttcgcc aaagcaacag gttgcaggaa 
tcaatntata atgcngcagg agggaagana 
cttttggtga aatgnttcat gtgcctgcaa 
tgggttgcaa tttgtgcgcc attccagggt 



acaatggatc ggttggctgc aaagttagaa 6 0 
ataaacatta taactcattc tatgggcggt 120 
agcgatattt ttgagaaata tgttaagaat 180 
gcaccaggaa ccatcaattc naccttt 237 



<210> 17 

<211> 244 

<212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (1) . . (244) 
<223> n= unknown 

<400> 17 

gattttcgcc aaagcaacag gttgcaggaa 
tgcaatttat aatgctgcag gagggaagaa 
tcttttggtg aaatgtttca tgtgcctgca 
ttgggttgca atttgtgcgc cattccaggg 
aaat 



acaatggatc ggtnggctgc aaagttagaa 60 
aataaacatt ataactcatt ctatgggcgg 12 0 
aagcgatatt tttgagaaat atgttaagaa 180 
tgcaccagga accatcaatt ctaccttttt 24 0 
244 



<210> 
<211> 
<212> 



18 

263 

DNA 



<213> Glycine max 



<400> 18 

gatgaaacta aaccgtgggc gactaagctt 
gttcgttgct tcatagaaga ggtcattggt 
ggaggattgg ttgcattgta ttttgcggca 
ttgcttaagc aacacctttt tgggggtttc 
cgaaaatatt tccatgggcc gga 



gtttactcgg ttgatttatg gcaagatcaa 60 
gaaccagtct atcttgtggg caactcacta 120 
aacaaccctc atttagtgaa aggtgtcgca 180 
tgccaaatcc cataaaaagt ccaagactag 240 
263 



<210> 19 

<211> 311 

<212> DNA 

<213> Zea mays 

<220> 

<221> unsure 
<222> (1) . . (311) 
<223> n=unknown 

<400> 19 

cggacgctgg ncatgttcgg agccccctac 
cagacgtccg aggtgtactc ccgctacttc 
agcgagagga cccggaagaa ggccgtcatc 
ctcgagttcg tccggaacac tccgccggcg 
ctggtcgcgc cgacgctccc cggcgggttc 
acggacatcc t 



gacttccgct acgcgccgcc gtcccccggc 60 
aaggagctga tggagctggt cgaggccgcg 12 0 
ctcggccaca gcttcggcgg catggtcgcg 180 
tggcggcgcg agcacatcga gcgcctcgtc 240 
ctggagccgg tgcgcaactt cgcgtccggg 300 
311 



<210> 20 
<211> 1155 
<212> DNA 
<213> Zea mays 

<400> 20 

tcgacccacg cgtccggcca caagaaccct 
agagccgcac tggaagacat gggataccga 
gacttccgct acgcgccgcc gtcccccggc 
aaggagctga tggagctggt cgaggccgca 
ctcggccaca gcttcggcgg catggtcgcg 
tggcggcgcg agcacatcga gcgcctcgtc 
ctggagccgg tgcgcaactt cgcgtccggg 
ccgctggcca cgcgagccat gtgragragc 
ccggccgtgt tcgggcgcct gcaggcgccg 
gcgtccgcgc acgacatgga gcgcttcctc 
cccttcagga gacgggccgt ccccaagatg 
acgtacatca gcggggtcgg caacaggacg 
ttcgacgcgg ccccggaggt ggcggcgtac 



ctcaagtcag actggtgcct cggaaagctg 60 
gacggagaca ccatgttcgg agccccctac 120 
cagacgtccg aggtgtactc ccgctacttc 180 
agcgagagga cccggaagaa ggccgtcatc 240 
ctcgagttcg tccggaacac tccgccggcg 3 00 
ctggtcgcgc cgacgctccc cggcgggttc 360 
acggacatcc tmtacgtgcc agcgacgacg 420 
ttcgagagcg ccatcgtgaa cttcccgtcg 4 80 
ctcgtggtca ccagggagcg gaactactcc 54 0 
gccgccgtcg gctccggcga ggccgcggag 600 
ggcagcttcg cggcgccgat ggtgcccatg 660 
ccgctgcggc tggtgttctg gggcgacgac 720 
ggggacggag atggcaagat caatttgatc 780 



agcgtcttgg cgtttgagaa ggagatgcgt cggcagccgg agcagaagaa gcagttcaaa 840 
tccatcaaga tcgataaggc ccagcattct acgatcgtca cggatgattt tgccctgcac 900 
agggtcattc aagaaattgt tgaggccaat aatcagaaga ttccatccta aattcttcat 960 
gtcatgtatg cattaccgag ctgtgggggc caatagtggg ttgggagttg ggacatcggt 1020 
tccgtgctta aaacggtcgt ggtgtggtct caattcaatc gattagttat ttgttaacgt 108 0 
caattgcttg cctcaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 114 0 
aaaaaaaaar gggcg 1155 



<210> 21 
<211> 328 
<212> DNA 
<213> Zea mays 

<400> 21 

gttggaatgc tcttcaactt tcttacgctc 
ctggcttaaa tgcatgctgc ttgaacccta 
aaggcctgat agtggtcttc tgcaattaca 
ctctcttcag tatggaaaga atgggtcaaa 
gcaattatcg ctgttccgta tgattggaga 
ctgtactttc acaattaaac aggatcag 



atttacattg cttttctctg cggtaaattg 60 
taatcagata gaccatcccg aatgcaagtc 120 
gagctggacc ctggttatat aacaggtcct 180 
tggtgtgtag agtttggcat tgaagctaat 240 
ctgcccccat caatgcttga ggagagagat 3 00 
328 



<210> 22 
<211> 356 
<212> DNA 
<213> Zea mays 

<400> 22 

gtctttctgc aattacagag ctggaccctg 
tcagtatgga aagaatgggt caaatggtgt 
atcgctgttc cgtatgattg gagactgccc 
tttcacaaat taaagtttgt aacacttgcc 
catctgttag gatcagtaag agttaatggc 
cagatcccac aagcatacgg ttaccgccaa 



gttatataac aggtttcagg tcctctctct 60 
gtagagtttg gcattgaagc taatgcaatt 120 
ccatcaatgc ttgaggagag agatctgtac 180 
tcaacttgtt atgaagcaac caatgctata 240 
ccatgacgga ttcaggttcc tgctcaccaa 300 
tgcctgcagt tggacagtac caaccc 3 56 



<210> 23 
<211> 1552 
<212> DNA 
<213> Zea mays 

<400> 23 

tcgacccacg cgtccgcaga catgatcatt ggtgatgaca ctgtgtacta ctatcatgac 60 
atgatagtgg aaatgattaa atggggatat caagaaggaa aaactctctt tggatttggt 120 
tatgatttcc gtcaaagcaa caggctctca gagacacttg acagattttc taaaaagctg 180 
gagtcagtgt acacagcttc tggtggaaag aagatcaatc tcattactca ttcaatgggg 240 
ggattacttg tgaaatgttt catctcactg cacagtgata tatttgaaaa atatgtcaar 300 



agttggatcg caattgctgc accattccar ggtgcccctg ggtamataac taccaktytg 3 60 
ctgaatggaa tgtcttttgt craaggatgg gaaycaagat tctttatttc caaawkgkgt 420 
atgcascaat tgctacttga gtgcccatca atctatgagk tgctgscaam ccctaacttt 4 80 
ccagtggaga gacatcccac tgctacagat ttggagagag aatttggata mcagtggcaa 540 
gaaaagtgcc ctgttagagt cgtatgagcc tgaggaagca ataaagatga ttaaagaggc 600 
tctttccagt aatgagatca ttgctgatgg catgcatatt ccggtgcccc ttaatttgga 660 
tatattgaat tgggcaaaga aacttatgat cttttatgca gtacaaagct tccggaatca 720 
gtgaaattct acaacattta tgggattgat tatgatactc cacatactgt ctgctatggc 780 
agtgaacagc agccggtttc aagtcttagt agcctcttat atgctcaggg aaaatacgtc 840 
tatgttgatg gcgacggatc tgttcccgca gaatcagcaa aggctgacgg atttaatgca 900 
gtggcaaggg ttggggttgc tcctgaccac cggggaatcg tgtgcagtcg ccgcgcgttc 960 
cggatcgtcc agcactggct gcacgccgga gaacctgacc cgttctacga cccgctgagc 102 0 
gactatgtca tactcccaac acgcttacga aatcgagaag catcgtgaga aacacgggga 108 0 
tgtcacgtca gtagcggagg actgggagat catctcccct aacgacggca agaccatrrg 114 0 
gccaggcgag cttcctccta tggtcagcac actgaccacg agccgggaag gcaaggaggg 1200 
agcactggaa gaggcgcatg ccaccgtggt cgttcacccg gagaagaagg gacggcagca 1260 
tgtgcaagtt agggctgtgg gtgtcagcca tggtggctaa agccgtagga gccacgttgg 132 0 
ttgtctactc tatctagcag tagcagctat acctctgtgc acgcactgta aaattggatg 1380 
tacatatatg gctatgacct ctgtagggat ctggttttag aagtataaat gggcaccctg 144 0 
cctgcttgta aatgttcaga accgaaaaca caggccctgt tctttttttt cctttttaaa 1500 
aaaaataaaa agatggtaaa ggattccatt aaaaaaaaaa aaaaaaaagg eg 1552 



<210> 24 
<211> 227 
<212> DNA 
<213> Zea mays 

<220> 

<221> unsure 
<222> (1) . . (227) 
<223> n=unknown 

<400> 24 

ttggttatga tttccgtcaa agcaacaggc 
agctggagtc agtgtacaca gcttctggtg 
tggggggatt acttgtgaaa tgtntcatct 
tcaagagttg gntegcaatt gcngcaccat 



tctcagagac acttgacaga ttttctaaaa 60 
gaaagaagat caatctcatt actcattcaa 120 
cactgcacag tgatatatnt gaaaaatatg 180 
tccaaggtgc ccctggg 227 



<210> 25 
<211> 1587 
<212> DNA 
<213> Zea mays 

<220> 

<221> unsure 
<222> (1) . . (1587) 



<223> n=unknown 



<400> 25 

ggagattgtc gtgccggagg acgaccacgg 
ctggtttgta gaactcgacc cacgcgtccg 
gacgaccacg gcctgtttgc catcgacatt 
catctgtcta tggtgtatca cttccatgat 
gagaaaggga ccacactatt tggatatggt 
aaagcgatgg ctggtttgag agcaaaactt 
aaagttaatt taatctcaca ttctatgggt 
aatcatgatg tattcactaa gtatgtcaac 
ggtgcccccg gatgcatcaa tgattctcta 
gagagcttct ttttcgtatc tagatgggca 
atctatgaaa tgttaccaaa tccagaattc 
tggcgtaaga accctgaaaa ggatggaact 
tgtgtgtcct tgttcgaaga agctttaagg 
gtagcactac cattcaatat gtcagtcttc 
gacaatgctg aattaccaga tactgtgagc 
actccatacg atgtatgcta tggctcagaa 
tgtcacacag tgccggcata cacttatgtg 
gcacgggctg atgggttctc tgcgaaagaa 
ctgctgtccg atgagaacgt attcaagctt 
aagtcagagt ggcgttgcgt gtctaaatcc 
tagttcttca ggaagactgt tattttggcc 
tctggtgatt gtattgtatg tctgcacgat 
gattagtgcc cttgtacgat gtatcattgg 
tttntttgnc cttccactct ggacntagta 
nngtctgnnn nnnnnnnnnn nnnnnnnnrm 
nnnnnnnnnn nnaaaaaaaa awgkgaagcc 
aaaaaaaaaa aaaaaaaaaa aaaaaaa 



cctgtttgcc atcgacattc ttgatccttc 60 
cccaccgtcc gggagattgt cgtgccggag 120 
cttgatcctt cctggtttgt agaacttctc 180 
atgattgata tgctcataaa ctgtggatat 240 
tatgattttc gccaaagcaa caggatagac 3 00 
gagacagctc ataagacctc tggagggaaa 360 
ggattgctag tacgctgctt catgtctatg 42 0 
aaatggattt gcattgcttg tccattccaa 480 
cttactggat tgcaatttgt ttatggtttt 540 
atgcaccaat tgcttgtcga atgcccatca 600 
aagtggaagg aaaaaccaat tattcaggtt 660 
SJtggagcttg ttcaatatga agcaactgat 720 
aataatgagc tcacgtataa cggaaagaaa 780 
aaatgggcca ccaagactcg ccaaatccta 840 
ttttacaata tatacgggac atcttatgaa 900 
agctctccga ttggagattt gtcagaagtg 960 
gatggagatt gcacggttcc catagaatcg 102 0 
agagttggcg tcaaggcgga ccaccgtggc 108 0 
ctcaagaaat ggctcggtgt gagcgagaag 1140 
tactccaaag tgacctaatt gggttgcctg 1200 
tttcctcctg aagagaagat gaaacaaaat 1260 
gtaaatctct gcaagctgca cggaacaagg 132 0 
caggcatttn tttttgaacc tangggcata 1380 
aagaatatnt gaatcgacct tanttnnaan 1440 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1500 
gntnntnntt tnaaaagnnt tttnnnaaaa 1560 
1587 



<210> 26 
<211> 300 
<212> DNA 
<213> Zea mays 

<220> 

<221> unsure 
<222> (1) . . (300) 
<22 3> n=unknown 

<400> 26 

gacaaagcga tggctggttt gagagcaaaa 
aaaaaagtta atttaatctc acattctatg 
atgaatcatg atgtgagttt tcatgttttc 
atgtcaattt cccccatttt ctaggtattc 
gcttgtccat tccaaggtaa cttatgggac 



cttgagacag ctcataagac ctctggaggg 60 
ggtggattgc tagtacgctg cttcatgtct 120 
tgtgtttttt ttgcttttgc ataaatatcc 180 
actangtatg tcaacaaatg gatttgcatt 24 0 
atttcaattg tttattanat natggggncc 300 



<210> 27 
<211> 1240 
<212> DNA 
<213> Zea mays 

<220> 

<221> unsure 
<222> (1) . . (1240) 
<223> n=unknown 

<400> 27 

tcgacccacg cgtccggttc ccagttccca 
tgccatattt gtaacacaga ttactatata 
aagascaata ttgaactcat agwagsgaca 
acnactccat ggggtcnttn attttntgcn 
tgggggcagt gggtccgaac tggntgtaga 
atctttctta ggagttccta aggctgttgc 
tgccggttgc taggtataag taatgattca 
aatgaatatt attggatgct cgacaagctt 
gacctcacaa gtccagggag tgcctgcctc 
atgcaccgac ggtaccgtga ggtcatttct 
ggcaaggtca ggactgacaa gacctaccct 
aagacaaacg agaactacag gctgctttcg 
gttcaagctc tacaaggtga ggtctgttca 
cacctacgac gaccgcacca tccgctaccc 
caagatcgat ctggagacca acaagatcat 
ggtcatggtg atcggcagga ggaataccgg 
gcataagggc aacttcgaga ccatccacgt 
ttctcctatt tgttgtacag gaaaacatag 
tgtggagact tgatttcata taaagttagg 
gtttagattt agagtacact atgtatgcgt 
aatggtactt atttatatta attaattaaa 



ccgtgtagat 


ggttctggta 


taaaatgtat 


60 


caggttcgtg 


atcaaacttt 


gagcagaata 


120 


aatggtggaa 


atagggtggt 


ggkmgatccc 


180 


ttttacgnaa 


tggntcgaag 


ccctcctccg 


240 


accatataaa 


gctgtaatga 


atattggagg 


300 


tgggcttttt 


ttcttctaag 


caaaagatgt 


360 


tttatttaaa 


gcaaaaggga 


atagcaaaag 


420 


gcggagcttt 


tgctcccaag 


ccatcttctg 


480 


tgatcctcat 


catcaggaac 


aggctcaagt 


540 


atcctgatgc 


aacaccatgt 


acttgttgat 


600 


gctgggttca 


tggatgtcat 


ttccatccct 


660 


tcttcaccca 


atcagggatg 


aggatgccaa 


720 


gtttggccag 


aaagacatcc 


cctatctgaa 


780 


cgacccgctc 


atcaaggcca 


acgacaccat 


840 


ggacttcatc 


atgtttgacg 


tcggcaacgt 


900 


gcgtgtagga 


gtgatcaara 


taagggagaa 


960 


gctgcttgra 


gctttttgct 


atgtctagtt 


1020 


aatgaaattc 


aaatttggtg 


gccacaaaag 


1080 


cttaacatta 


gtgcaaacag 


ttgtatttta 


1140 


tgtttgacaa 


tgcttattta 


tgatatattg 


1200 


aaaaaaaaaa 






1240 



<210> 28 
<211> 324 
<212> DNA 
<213> Zea mays 

<400> 28 

cgaatgctcc tgacatggaa atattttcca 
catatgtcta taagttggcc ccacaggcag 
cggctgaagg cggggaggaa aatagctgct 
atgaaactgt tccagttctt agtgcgggct 
ctcgtttcaa ccctgccggc agcaagactt 
ctactctcct ggaaggcagg ggca 



tgtacggagt aggcattcct actgaaaggg 60 
aatgttatat acctttccga attgacacct 120 
tgaaaggggg tgtttactta gccgatggtg 180 
acatgtgtgc aaaaggatgg cgtggcaaaa 240 
acgtgagaga atacagccat tcaccaccct 3 00 
324 



<210> 29 
<211> 254 
<212> DNA 
<213> Zea mays 

<400> 29 

gaataaagag caacattgaa 
tcccacactc catgggggtc 
ctcccatggg gggtggcggt 
atattggagg acctttctta 
ccaaagatgt tgcc 



ctcatggtag caacaaatgg 
ctctattttt tgcattttat 
ggtccagact ggtgtgagaa 
ggagttccta aggctgttgc 



tggaaatagg gtggtggtga 60 
gaaatgggtc gaagcacctc 12 0 
gcatattaaa gctgtaatga 180 
tggccttttc tcatctgaag 24 0 
254 



<210> 30 

<211> 518 

<212> DNA 

<213> Mus musculus 

<400> 30 

tggaggacaa cgcggggtct gatacgactc actataggga atttggccct cgagcagtag 60 
attcggcacg atgggcacga ggactccatc atgttcctca agctttattc ctaccgggat 12 0 
gtcaacctgt ggtgccgcca gcgaagggtc aaggccaaag ctgtctctac agggaagaag 180 
gtcagtgggg ctgctgcgag caagctgtga gctatccaga caacctgacc taccgagatc 24 0 
tcgattactt catctttgct cctactttgt gttatgaact caactttcct cggtcccccc 300 
gaatacgaga gcgctttctg ctacgacgag ttcttgagat gctctttttt acccagcttc 360 
aagtggggct gatccaacag tggatggtcc ctactatcca gaactccatg gaagcccttt 42 0 
caagagcttc tgcagttttg gagaccgcga gttctacaga gattggtgga atgctgagtc 48 0 
tgtcaccgac ttttggcaga actggaatat ccccgtgg 518 



<210> 31 

<211> 299 

<212> DNA 

<213> Mus musculus 

<400> 31 

ccatgatggc tcaggtccca ctggcctgga ttgtgggccg attcttccaa gggaactatg 60 
gcaatgcagc tgtgtgggtg acactcatca ttgggcaacc ggtggctgtc tcatgtatgt 12 0 
ccacgactac tacgtgctca actacgatgc cccagtgggt catgagctac tgccaaaggc 180 
agccctccct aacctgggcc tggagttctg gaggggttcc tggctgcctg cacactcctc 24 0 
ctagtctggg aggcctctct gcccctatgc gctactcctg ctcttgggga tggcatttg 299 



<210> 32 
<211> 1895 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Inferred cDNA 
sequence 

<220> 

<221> unsure 
<222> (1) . . (1895) 
<223> n=unknown 

<400> 32 

gtctggtgtg atggggacag ggagggactt ccccttaccc agcactggtg ttggctgagg 60 
tgggtgctga gtctcagagc ttggcatgga gaccagacag ggctgggtct gcaagcctga 12 0 
ggctgccgcc ctgagctcgg gctgggacgt gcccagaggt gttgggagga tctggggtga 180 
gtaccctgtg gccaggacta aaggggctnc accctcctgt ccatccctcg cagatcttga 240 
gcaatgcccg gttatttctg gagaacctca tcaagtatgg catcctggtg gaccccatcc 300 
aggtggtttc tctgttcctg aaggatccct atagctggcc cgccccatgc ctggttattg 360 
cggccaatgt ctttgctgtg gctgcattcc aggttgagaa gcgcctggcg gtgggtgccc 42 0 
tgacggagca ggcgggactg ctgctgcacg tggccaacct ggccaccatt ctgtgtttcc 480 
cagcggctgt ggtcttactg gttgagtcta tcactccagt gggctccctg ctggcgctga 54 0 
tggcgcacac catcctcttc ctcaagctct tctcctaccg cgacgtcaac tcatggtgcc 600 
gcagggccag ggccaaggct gcctctgcag ggaagaaggc cagcagtgct gctgccccgc 660 
acaccgtgag ctacccggac aatctgacct accgcgatct ctactacttc ctcttcgccc 720 
ccaccttgtg ctacgagctc aactttcccc gctctccccg catccggaag cgctttctgc 780 
tgcgacggat ccttgagatg ctgttcttca cccagctcca ggtggggctg atccagcagt 840 
ggatggtccc caccatccag aactccatga agcccttcaa ggacatggac tactcacgca 900 
tcatcgagcg cctcctgaag ctggcggtcc ccaatcacct catctggctc atcttcttct 960 
actggctctt ccactcctgc ctgaatgccg tggctgagct catgcagttt ggagaccggg 1020 
agttctaccg ggactggtgg aactccgagt ctgtcaccta cttctggcag aactggaaca 1080 
tccctgtgca caagtggtgc atcagacact tctacaagcc catgcttcga cggggcagca 1140 
gcaagtggat ggccaggaca ggggtgttcc tggcctcggc cttcttccac gagtacctgg 12 00 
tgagcgtccc tctgcgaatg ttccgcctct gggcgttcac gggcatgatg gctcagatcc 12 60 
cactggcctg gttcgtgggc cgctttttcc agggcaacta tggcaacgca gctgtgtggc 1320 
tgtcgctcat catcggacag ccaatagccg tcctcatgta cgtccacgac tactacgtgc 1380 
tcaactatga ggccccagcg gcagaggcct gagctgcacc tgagggcctg gcttctcact 144 0 
gccacctcac acccgctgcc agagcccacc tctcctccta ggcctcgagt gctggggatg 15 00 
ggcctggctg cacagcatcc tcctctggtc ccagggaggc ctctctgccc ctatggggct 15 60 
ctgtcctgca cccctcaggg atggcgacag caggccagac acagtctgat gccagctggg 162 0 
agtcttgctg accctgcccc gggtccgagg gtgtcaataa agtgctgtcc agtgacctct 1680 
tcagcctgcc aggggcctgg ggcctggtgg ggggtatggc cacacccaca agggcgagtg 174 0 
ccagagctgt gtggacagct gtcccaggac ctgccgggga gcagcagctc cactgcagca 1800 
gggcgggcat ggccggtagg gggagtgcaa ggccaggcag acgcccccat tccccacact 1860 
cccctaccta gaaaagctca gctcaggcgt cctct 18 95 



<210> 33 
<211> 1766 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inferred cDNA 
sequence 

<400> 33 

cacgactggg ccgcgacgtg gtgcgggccg aagccatggg cgaccgcgga ggcgcgggaa 60 
gctctcggcg tcggaggacc ggctcgcggg tttccatcca gggtggtagt gggcccatgg 12 0 
tagacgaaga ggaggtgcga gacgccgctg tgggccccga cttgggcgcc gggggtgacg 180 
ctccggctcc ggctccggtt ccggctccag cccacacccg ggacaaagac cggcagacca 240 
gcgtgggcga cggccactgg gagctgaggt gccatcgtct gcaagactct ttgttcagct 300 
cagacagcgg tttcagcaat taccgtggta tcctgaattg gtgcgtggtg atgctgatcc 360 
tgagtaatgc aaggttattt ttagagaatc ttatcaagta tggcatcctg gtggatccca 42 0 
tccaggtggt gtctctgttt ctgaaggacc cctacagctg gcctgcccca tgcttgatca 480 
ttgcatccaa tatctttatt gtggctacat ttcagattga gaagcgcctg tcagtgggtg 540 
ccctgacaga gcagatgggg ctgctgctac atgtggttaa cctggccaca attatctgct 600 
tcccagcagc tgtggcctta ctggttgagt ctatcactcc agtgggttcc ctgtttgctc 660 
tggcatcata ctccatcatc ttcctcaagc ttttctccta ccgggatgtc aatctgtggt 720 
gccgccagcg aagggtcaag gccaaagctg tgtctgcagg gaagaaggtc agtggggctg 78 0 
ctgcccagaa cactgtaagc tatccggaca acctgaccta ccgagatctc tattacttca 84 0 
tctttgctcc tactttgtgt tatgaactca actttcctcg atccccccga atacgaaagc 900 
gctttctgct acggcgggtt cttgagatgc tctttttcac ccagcttcaa gtggggctga 960 
tccagcagtg gatggtccct actatccaga actccatgaa gcccttcaag gacatggact 102 0 
attcacgaat cattgagcgt ctcttaaagc tggcggtccc caaccatctg atatggctca 108 0 
tcttcttcta ttggcttttc cactcatgtc tcaatgctgt ggcagagctc ctgcagtttg 114 0 
gagaccgcga gttctacagg gactggtgga atgctgagtc tgtcacctac ttttggcaga 12 00 
actggaatat ccccgtgcac aagtggtgca tcagacactt ctacaagcct atgctcagac 1260 
tgggcagcaa caaatggatg gccaggactg gggtcttttt ggcgtcagcc ttcttccatg 132 0 
agtacctagt gagcattccc ctgaggatgt tccgcctctg ggcattcaca gccatgatgg 1380 
ctcaggtccc actggcctgg attgtgaacc gcttcttcca agggaactat ggcaatgcag 1440 
ctgtgtgggt gacactcatc attgggcaac cggtggctgt gctcatgtat gtccacgact 1500 
actacgtgct caactatgat gccccagtgg gggcctgagc tactgccaaa ggccagccct 1560 
ccctaacctg ggcctggagt tctggagggc ttcctggctg cctgcacact cctcctagtc 162 0 
tgggaggcct ctctgcccct atggggccta ctcctgctct tggggatggc acctgagtcc 1680 
agctggtatg agccagtgct gggagtctgt gctgaccagg ggctgaggat atcaataaag 174 0 
agctatctaa aaaaaaaaaa aaaaaa 1766 



<210> 34 
<211> 409 
<212> PRT 

<213> Homo sapiens 



<400> 34 

Arg Arg Ser Leu Leu Asp Glu Leu Leu Glu Val 
15 10 



Asp His lie Arg Thr 
15 



lie Tyr His Met Phe lie Ala Leu Leu lie Leu Phe lie Leu Ser Thr 
20 25 30 



Leu Val Val Asp Tyr lie Asp Glu Gly Arg Leu Val Leu Glu Phe Ser 



Leu Leu Ser Tyr Ala Phe Gly Lys Phe Pro Thr Val Val Trp Thr Trp 
50 55 60 

Trp lie Met Phe Leu Ser Thr Phe Ser Val Pro Tyr Phe Leu Phe Gin 
65 70 75 80 

His Trp Arg Thr Gly Tyr Ser Lys Ser Ser His Pro Leu lie Arg Ser 



Leu Phe His Gly Phe Leu Phe Met lie Phe Gin lie Gly Val Leu Gly 
100 105 110 

Phe Gly Pro Thr Tyr Val Val Leu Ala Tyr Thr Leu Pro Pro Ala Ser 
115 120 125 

Arg Phe He He He Phe Glu Gin He Arg Phe Val Met Lys Ala His 
130 135 140 

Ser Phe Val Arg Glu Asn Val Pro Arg Val Leu Asn Ser Ala Lys Glu 
145 150 155 160 

Lys Ser Ser Thr Val Pro He Pro Thr Val Asn Gin Tyr Leu Tyr Phe 
165 170 175 

Leu Phe Ala Pro Thr Leu He Tyr Arg Asp Ser Tyr Pro Arg Asn Pro 
180 185 190 

Thr Val Arg Trp Gly Tyr Val Ala Met Lys Phe Ala Gin Val Phe Gly 
195 200 205 

Cys Phe Phe Tyr Val Tyr Tyr He Phe Glu Arg Leu Cys Ala Pro Leu 
210 215 220 

Phe Arg Asn He Lys Gin Glu Pro Phe Ser Ala Arg Val Leu Val Leu 
225 230 235 240 

Cys Val Phe Asn Ser He Leu Pro Gly Val Leu He Leu Phe Leu Thr 
245 250 255 



Phe Phe Ala Phe Leu His Cys Trp Leu Asn Ala Phe Ala Glu Met Leu 
260 265 270 



Arg Phe Gly Asp Arg Met Phe Tyr 
275 280 

Tyr Ser Asn Tyr Tyr Arg Thr Trp 
290 295 

Tyr Tyr Tyr Ala Tyr Lys Asp Phe 
305 310 

Lys Ser Ala Ala Met Leu Ala Val 
325 

Glu Tyr Ala Leu Ala Val Cys Leu 
340 

Val Leu Phe Met Phe Phe Gly Met 
355 360 

Ser Arg Lys Lys Pro lie Trp Asn 
370 375 

Leu Gly Asn Gly Val Leu Leu Cys 
385 390 

Arg Arg His Cys Pro Leu Lys Asn 
405 



Lys Asp Trp Trp Asn Ser Thr Ser 
285 

Asn Val Val Val His Asp Trp Leu 
300 

Leu Trp Phe Phe Ser Lys Arg Phe 
315 320 

Phe Ala Val Ser Ala Val Val His 
330 335 

Ser Phe Phe Tyr Pro Val Leu Phe 
345 350 

Ala Phe Asn Phe lie Val Asn Asp 
365 

Val Leu Met Trp Thr Ser Leu Phe 
380 

Phe Tyr Ser Gin Glu Trp Tyr Ala 
395 400 

Pro 



<210> 35 
<211> 409 
<212> PRT 

<213> Mus nvusculus 
<400> 35 

Arg Gin Ser Leu Leu Asp Glu Leu Phe Glu Val Asp His lie Arg Thr 



He Tyr His Met Phe He Ala Leu Leu He Leu Phe Val Leu Ser Thr 



lie Val Val Asp Tyr lie Asp Glu Gly Arg Leu Val Leu Glu Phe Asn 



Leu Leu Ala Tyr Ala Phe Gly Lys Phe Pro Thr Val He Trp Thr Trp 
50 55 60 



Trp Ala Met Phe Leu Ser Thr Leu Ser He Pro Tyr Phe Leu Phe Gin 



65 70 75 80 

Pro Trp Ala His Gly Tyr Ser Lys Ser Ser His Pro Leu lie Tyr Ser 
85 90 95 

Leu Val His Gly Leu Leu Phe Leu Val Phe Gin Leu Gly Val Leu Gly 
100 105 110 

Phe Val Pro Thr Tyr Val Val Leu Ala Tyr Thr Leu Pro Pro Ala Ser 
115 120 125 

Arg Phe lie Leu lie Leu Glu Gin lie Arg Leu lie Met Lys Ala His 
130 135 140 

Ser Phe Val Arg Glu Asn lie Pro Arg Val Leu Asn Ala Ala Lys Glu 
145 150 155 160 

Lys Ser Ser Lys Asp Pro Leu Pro Thr Val Asn Gin Tyr Leu Tyr Phe 
165 170 175 

Leu Phe Ala Pro Thr Leu lie Tyr Arg Asp Asn Tyr Pro Arg Thr Pro 
180 185 190 

Thr Val Arg Trp Gly Tyr Val Ala Met Gin Phe Leu Gin Val Phe Gly 
195 200 205 

Cys Leu Phe Tyr Val Tyr Tyr He Phe Glu Arg Leu Cys Ala Pro Leu 
210 215 220 

Phe Arg Asn He Lys Gin Glu Pro Phe Ser Ala Arg Val Leu Val Leu 
225 230 235 240 

Cys Val Phe Asn Ser He Leu Pro Gly Val Leu He Leu Phe Leu Ser 
245 250 255 

Phe Phe Ala Phe Leu His Cys Trp Leu Asn Ala Phe Ala Glu Met Leu 
260 265 270 

Arg Phe Gly Asp Arg Met Phe Tyr Lys Asp Trp Trp Asn Ser Thr Ser 
275 280 285 

Tyr Ser Asn Tyr Tyr Arg Thr Trp Asn Val Val Val His Asp Trp Leu 
290 295 300 

Tyr Tyr Tyr Val Tyr Lys Asp Leu Leu Trp Phe Phe Ser Lys Arg Phe 
305 310 315 320 



Lys Ser Ala Ala Met Leu Ala Val Phe Ala Leu Ser Ala Val Val His 



325 



330 



335 



Glu Tyr Ala Leu 
340 

Val Leu Phe Met 
355 

Ser Arg Lys Arg 
370 

Leu Gly Tyr Gly 
385 

Arg Gin His Cys 



Ala lie Cys Leu 



Phe Phe Gly Met 
3S0 

Pro lie Trp Asn 
375 

Leu lie Leu Cys 
390 

Pro Leu Lys Asn 
405 



Ser Tyr Phe Tyr 
345 

Ala Phe Asn Phe 



He Met Val Trp 
380 

Phe Tyr Ser Gin 
395 

Pro 



Pro Val Leu Phe 
350 

lie Val Asn Asp 
365 

Ala Ser Leu Phe 



Glu Trp Tyr Ala 
400 



<210> 36 
<211> 429 
<212> PRT 

<213> Saccharomyces 
<400> 36 

Asp Lys Ala Asp Ala 
1 5 

Gly He Tyr Val Phe 
20 

Cys Cys Thr Asp Tyr 
35 

Glu He Val Gin Tyr 
50 

Asp Leu Ala Met Phe 
65 

Leu Val Lys Lys Arg 
85 

Val Ser He Phe Glu 
100 

Val Tyr Tyr Phe Asp 
115 



cerevisiae 



Pro Pro Gly Glu Lys Leu 
10 

Ala Trp Met Phe Leu Gly 
25 

Tyr Ala Ser Tyr Gly Ser 
40 

Met Thr Thr Asp Leu Phe 
55 

Leu Cys Thr Phe Phe Val 
70 75 

He He Asn Trp Lys Trp 
90 

Leu Ala Phe He Pro Val 
105 

Phe Asn Trp Val Thr Arg 
12 0 



Glu Ser Asn Phe Ser 
15 

Trp He Ala He Arg 
30 

Ala Trp Asn Lys Leu 

45 

Thr He Ala Met Leu 
60 

Val Phe Val His Trp 
80 

Thr Gly Phe Val Ala 
95 

Thr Phe Pro lie Tyr 
110 

He Phe Leu Phe Leu 
125 



His Ser Val Val Phe 
130 

Gly Tyr Leu Trp Asp 
145 

Leu Gin Lys Tyr Lys 
165 

Gin Lys Ser Cys Asp 
180 

Asp Asn Asp Phe Pro 
195 

Cys Leu Phe Pro Val 
210 

Arg lie Arg Trp Arg 
225 

Thr lie Phe Leu Met 
245 

Ala Met Arg Cys lie 
260 

lie Pro Ala Thr Gin 
275 

Gly Phe Thr Val Leu 
290 

Leu Leu Asn Cys Val 
305 

Tyr Gly Asp Trp Trp 
325 

Trp Asn Val Pro Val 
340 

Ser Met Gly Ala Leu 
355 

Phe Phe Leu Ser Ala 
370 



Val Met Lys Ser His Ser 
135 

lie Lys Gin Glu Leu Glu 
150 155 

Glu Ser Leu Ser Pro Glu 
170 

Phe Cys Leu Phe Glu Leu 
185 

Asn Asn lie Ser Cys Ser 
200 

Leu Val Tyr Gin lie Asn 
215 

Tyr Val Leu Glu Lys Val 
230 235 

Met Val Thr Ala Gin Phe 
250 

Gin Phe His Asn Thr Pro 
265 

Glu Trp Phe His Leu Leu 
280 

Tyr Met Leu Thr Phe Tyr 
295 

Ala Glu Leu Thr Arg Phe 
310 315 

Asn Cys Val Ser Phe Glu 
330 

His Lys Phe Leu Leu Arg 
345 

His Leu Ser Lys Ser Gin 
360 

Val Phe His Glu Met Ala 
375 



Phe Ala Phe Tyr Asn 
140 

Tyr Ser Ser Lys Gin 
160 

Thr Arg Glu lie Leu 
175 

Asn Tyr Gin Thr Lys 
190 

Asn Phe Phe Met Phe 
205 

Tyr Pro Arg Thr Ser 
220 

Cys Ala He He Gly 
240 

Phe Met His Pro Val 
255 

Thr Phe Gly Gly Trp 
270 

Phe Asp Met He Pro 
285 

Met He Trp Asp Ala 
300 

Ala Asp Arg Tyr Phe 
320 

Glu Phe Ser Arg He 
335 

His Val Tyr His Ser 

350 

Ala Thr Leu Phe Thr 
365 

Met Phe Ala He Phe 
380 



Arg Arg Val Arg Gly Tyr Leu Phe Met Phe Gin Leu Ser Gin Phe Val 
385 390 395 400 



Trp Thr Ala Leu Ser Asn Thr Lys Phe Leu Arg Ala Arg Pro Gin Leu 
405 410 415 

Ser Asn Val Val Phe Ser Phe Gly Val Cys Ser Gly Pro 
420 425 



<210> 37 
<211> 432 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 37 

Glu Thr Val Val Thr Val Glu Thr Thr He He Ser Ser Asn Phe Ser 



Gly Leu Tyr Val Ala Phe Trp Met Ala He Ala Phe Gly Ala Val Lys 
20 25 30 

Ala Leu He Asp Tyr Tyr Tyr Gin His Asn Gly Ser Phe Lys Asp Ser 



Glu He Leu Lys Phe Met Thr Thr Asn Leu Phe Thr Val Ala Ser Val 
50 55 60 

Asp Leu Leu Met Tyr Leu Ser Thr Tyr Phe Val Val Gly He Gin Tyr 
65 70 75 80 

Leu Cys Lys Trp Gly Val Leu Lys Trp Gly Thr Thr Gly Trp He Phe 
85 90 95 

Thr Ser He Tyr Glu Phe Leu Phe Val He Phe Tyr Met Tyr Leu Thr 
100 105 110 

Glu Asn He Leu Lys Leu His Trp Leu Ser Lys He Phe Leu Phe Leu 
115 120 125 

His Ser Leu Val Leu Leu Met Lys Met His Ser Phe Ala Phe Tyr Asn 
130 135 140 

Gly Tyr Leu Trp Gly He Lys Glu Glu Leu Gin Phe Ser Lys Ser Ala 
145 150 155 160 



Leu Ala Lys Tyr Lys Asp Ser He Asn Asp Pro Lys Val He Gly Ala 
165 170 175 



Leu Glu Lys Ser Cys Glu Phe Cys Ser Phe Glu Leu Ser Ser Gin Ser 
180 185 190 



Leu Ser Asp Gin Thr Gin Lys Phe Pro Asn Asn lie Ser Ala Lys Ser 
195 200 205 

Phe Phe Trp Phe Thr Met Phe Pro Thr Leu lie Tyr Gin lie Glu Tyr 
210 215 220 

Pro Arg Thr Lys Glu lie Arg Trp Ser Tyr Val Leu Glu Lys lie Cys 
225 230 235 240 

Ala He Phe Gly Thr He Phe Leu Met Met He Asp Ala Gin He Leu 
245 250 255 

Met Tyr Pro Val Ala Met Arg Ala Leu Ala Val Arg Asn Ser Glu Trp 
260 265 270 

Thr Gly He Leu Asp Arg Leu Leu Lys Trp Val Gly Leu Leu Val Asp 
275 280 285 

He Val Pro Gly Phe He Val Met Tyr He Leu Asp Phe Tyr Leu He 
290 295 300 

Trp Asp Ala He Leu Asn Cys Val Ala Glu Leu Thr Arg Phe Gly Asp 
305 310 315 320 

Arg Tyr Phe Tyr Gly Asp Trp Trp Asn Cys Val Ser Trp Ala Asp Phe 
325 330 335 

Ser Arg He Trp Asn He Pro Val His Lys Phe Leu Leu Arg His Val 
340 345 350 

Tyr His Ser Ser Met Ser Ser Phe Lys Leu Asn Lys Ser Gin Ala Thr 
355 360 365 

Leu Met Thr Phe Phe Leu Ser Ser Val Val His Glu Leu Ala Met Tyr 
370 375 380 

Val He Phe Lys Lys Leu Arg Phe Tyr Leu Phe Phe Phe Gin Met Leu 
385 390 395 400 

Gin Met Pro Leu Val Ala Leu Thr Asn Thr Lys Phe Met Arg Asn Arg 
405 410 415 



Thr He He Gly Asn Val He Phe Trp Leu Gly He Cys Met Gly Pro 
420 425 430 



<210> 38 

<211> 1942 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 38 

ctctcgtgaa tcctttttcc tttcttcttc ttcttctctt cagagaaaac tttgcttctc 60 
tttctataag gaaccagaca cgaatcccat tcccaccgat ttcttagctt cttccttcaa 120 
tccgctcttt ccctctccat tagattctgt ttcctctttc aatttcttct gcatgcttct 180 
cgattctctc tgacgcctct tttctcccga cgctgtttcg tcaaacgctt ttcgaaatgg 240 
cgattttgga ttctgctggc gttactacgg tgacggagaa cggtggcgga gagttcgtcg 300 
atcttgatag gcttcgtcga cggaaatcga gatcggattc ttctaacgga cttcttctct 360 
ctggttccga taataattct ccttcggatg atgttggagc tcccgccgac gttagggatc 42 0 
ggattgattc cgttgttaac gatgacgctc agggaacagc caatttggcc ggagataata 48 0 
acggtggtgg cgataataac ggtggtggaa gaggcggcgg agaaggaaga ggaaacgccg 54 0 
atgctacgtt tacgtatcga ccgtcggttc cagctcatcg gagggcgaga gagagtccac 60 0 
ttagctccga cgcaatcttc aaacagagcc atgccggatt attcaacctc tgtgtagtag 660 
ttcttattgc tgtaaacagt agactcatca tcgaaaatct tatgaagtat ggttggttga 72 0 
tcagaacgga tttctggttt agttcaagat cgctgcgaga ttggccgctt ttcatgtgtt 78 0 
gtatatccct ttcgatcttt cctttggctg cctttacggt tgagaaattg gtacttcaga 84 0 
aatacatatc agaacctgtt gtcatctttc ttcatattat tatcaccatg acagaggttt 90 0 
tgtatccagt ttacgtcacc ctaaggtgtg attctgcttt tttatcaggt gtcactttga 960 
tgctcctcac ttgcattgtg tggctaaagt tggtttctta tgotcatact agctatgaca 1020 
taagatccct agccaatgca gctgataagg ccaatcctga agtctcctac tacgttagct 10 80 
tgaagagctt ggcatatttc atggtcgctc ccacattgtg ttatcagcca agttatccac 1140 
gttctgcatg tatacggaag ggttgggtgg ctcgtcaatt tgcaaaactg gtcatattca 1200 
ccggattcat gggatttata atagaacaat atataaatcc tattgtcagg aactcaaagc 12 60 
atcctttgaa aggcgatctt ctatatgcta ttgaaagagt gttgaagctt tcagttccaa 1320 
atttatatgt gtggctctgc atgttctact gcttcttcca cctttggtta aacatattgg 13 80 
cagagcttct ctgcttcggg gatcgtgaat tctacaaaga ttggtggaat gcaaaaagtg 144 0 
tgggagatta ctggagaatg tggaatatgc ctgttcataa atggatggtt cgacatatat 1500 
acttcccgtg cttgcgcagc aagataccaa agacactcgc cattatcatt gctttcctag 1560 
tctctgcagt ctttcatgag ctatgcatcg cagttccttg tcgtctcttc aagctatggg 1620 
cttttcttgg gattatgttt caggtgcctt tggtcttcat cacaaactat ctacaggaaa 1680 
ggtttggctc aacggtgggg aacatgatct tctggttcat cttctgcatt ttcggacaac 174 0 
cgatgtgtgt gcttctttat taccacgacc tgatgaaccg aaaaggatcg atgtcatgaa 1800 
acaactgttc aaaaaatgac tttcttcaaa catctatggc ctcgttggat ctccgttgat 186 0 
gttgtggtgg ttctgatgct aaaacgacaa atagtgttat aaccattgaa gaagaaaaga 192 0 
caattagagt tgttgtatcg ca 1942 



<210> 39 
<211> 520 



<212> PRT 

<213> Arabidopsis thaliana 



<400> 39 

Met Ala lie Leu Asp Ser Ala Gly Val Thr Thr Val Thr Glu Asn Gly 



Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 



Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 



Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val Arg Asp Arg lie Asp 



Ser Val Val Asn Asp Asp Ala Gin Gly Thr Ala Asn Leu Ala Gly Asp 
65 70 75 80 

Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 



Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 
100 105 110 

Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser Ser Asp Ala lie Phe 
115 120 125 

Lys Gin Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu lie 
130 135 140 

Ala Val Asn Ser Arg Leu He He Glu Asn Leu Met Lys Tyr Gly Trp 
145 150 155 160 

Leu He Arg Thr Asp Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp 
165 170 175 

Pro Leu Phe Met Cys Cys He Ser Leu Ser He Phe Pro Leu Ala Ala 
180 185 190 

Phe Thr Val Glu Lys Leu Val Leu Gin Lys Tyr He Ser Glu Pro Val 
195 200 205 

Val He Phe Leu His He He He Thr Met Thr Glu Val Leu Tyr Pro 
210 215 220 



Val Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr 
225 230 235 240 



Leu Met Leu Leu Thr Cys lie Val Trp Leu Lys Leu Val Ser Tyr Ala 
245 250 255 



His Thr Ser Tyr Asp 
260 

Asn Pro Glu Val Ser 
275 

Met Val Ala Pro Thr 
290 

Cys lie Arg Lys Gly 
305 

Phe Thr Gly Phe Met 
325 

Val Arg Asn Ser Lys 
340 

Glu Arg Val Leu Lys 
355 

Met Phe Tyr Cys Phe 
370 

Leu Cys Phe Gly Asp 
385 

Ser Val Gly Asp Tyr 
405 

Met Val Arg His He 
420 

Thr Leu Ala He He 
435 

Leu Cys He Ala Val 
450 

Gly He Met Phe Gin 
465 

Glu Arg Phe Gly Ser 
485 



He Arg Ser Leu Ala Asn 
265 

Tyr Tyr Val Ser Leu Lys 
280 

Leu Cys Tyr Gin Pro Ser 
295 

Trp Val Ala Arg Gin Phe 
310 315 

Gly Phe He He Glu Gin 
330 

His Pro Leu Lys Gly Asp 
345 

Leu Ser Val Pro Asn Leu 
360 

Phe His Leu Trp Leu Asn 
375 

Arg Glu Phe Tyr Lys Asp 
390 395 

Trp Arg Met Trp Asn Met 
410 

Tyr Phe Pro Cys Leu Arg 
425 

He Ala Phe Leu Val Ser 
440 

Pro Cys Arg Leu Phe Lys 
455 

Val Pro Leu Val Phe He 
470 475 

Thr Val Gly Asn Met He 
490 



Ala Ala Asp Lys Ala 
270 

Ser Leu Ala Tyr Phe 
285 

Tyr Pro Arg Ser Ala 
300 

Ala Lys Leu Val He 
320 

Tyr lie Asn Pro He 
335 

Leu Leu Tyr Ala He 
350 

Tyr Val Trp Leu Cys 
365 

He Leu Ala Glu Leu 
380 

Trp Trp Asn Ala Lys 
400 

Pro Val His Lys Trp 
415 

Ser Lys He Pro Lys 
430 

Ala Val Phe His Glu 
445 

Leu Trp Ala Phe Leu 
460 

Thr Asn Tyr Leu Gin 
480 

Phe Trp Phe He Phe 
495 



Cys lie Phe Gly Gin Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu 
500 505 510 



Met Asn Arg Lys Gly Ser Met Ser 
515 520 



<210> 40 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 40 

tgcaaattga cgagcacacc aaccccttc 2 9 

<210> 41 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonuclotide primer 

<400> 41 

aaggatgctt tgagttcctg acaatagg 2 8 



<210> 42 
<211> 1942 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 42 

ctctcgtgaa tcctttttcc tttcttcttc ttcttctctt cagagaaaac tttgcttctc 60 
tttctataag gaaccagaca cgaatcccat tcccaccgat ttcttagctt cttccttcaa 120 
tccgctcttt ccctctccat tagattctgt ttcctctttc aatttcttct gcatgcttct 180 
cgattctctc tgacgcctct tttctcccga cgctgtttcg tcaaacgctt ttcgaaatgg 240 
cgattttgga ttctgctggc gttactacgg tgacggagaa cggtggcgga gagttcgtcg 3 00 
atcttgatag gcttcgtcga cggaaatcga gatcggattc ttctaacgga cttcttctct 360 
ctggttccga taataattct ccttcggatg atgttggagc tcccgccgac gttagggatc 420 
ggattgattc cgttgttaac gatgacgctc agggaacagc caatttggcc ggagataata 480 



acggtggtgg cgataataac ggtggtggaa 
atgctacgtt tacgtatcga ccgtcggttc 
ttagctccga cgcaatcttc aaacagagcc 
ttcttattgc tgtaaacagt agactcatca 
tcagaacgga tttctggttt agttcaagat 
gtatatccct ttcgatcttt cctttggctg 
aatacatatc agaacctgtt gtcatctttc 
tgtatccagt ttacgtcacc ctaaggtgtg 
tgctcctcac ttgcattgtg tggctaaagt 
taagatccct agccaatgca gctgataagg 
tgaagagctt ggcatatttc atggtcgctc 
gttctgcatg tatacggaag ggttgggtgg 
ccggattcat gggatttata atagaacaat 
atcctttgaa aggcgatctt ctatatgcta 
atttatatgt gtggctctgc atgttctact 
cagagcttct ctgcttcggg gatcgtgaat 
tgggagatta ctggagaatg tggaatatgc 
acttcccgtg cttgcgcagc aagataccaa 
tctctgcagt ctttcatgag ctatgcatcg 
cttttcttgg gattatgttt caggtgcctt 
ggtttggctc aacggtgggg aacatgatct 
cgatgtgtgt gcttctttat taccacgacc 
acaactgttc aaaaaatgac tttcttcaaa 
gttgtggtgg ttctgatgct aaaacgacaa 
caattagagt tgttgtatcg ca 



gaggcggcgg agaaggaaga ggaaacgccg 54 0 
cagctcatcg gagggcgaga gagagtccac 600 
atgccggatt attcaacctc tgtgtagtag 660 
tcgaaaatct tatgaagtat ggttggttga 720 
cgctgcgaga ttggccgctt ttcatgtgtt 7 80 
cctttacggt tgagaaattg gtacttcaga 840 
ttcatattat tatcaccatg acagaggttt 900 
attctgcttt tttatcaggt gtcactttga 960 
tggtttctta tgctcatact agctatgaca 1020 
ccaatcctga agtctcctac tacgttagct 1080 
ccacattgtg ttatcagcca agttatccac 1140 
ctcgtcaatt tgcaaaactg gtcatattca 1200 
atataaatcc tattgtcagg aactcaaagc 12 60 
ttgaaagagt gttgaagctt tcagttccaa 1320 
gcttcttcca cctttggtta aacatattgg 1380 
tctacaaaga ttggtggaat gcaaaaagtg 1440 
ctgttcataa atggatggtt cgacatatat 1500 
agacactcgc cattatcatt gctttcctag 1560 
cagttccttg tcgtctcttc aagctatggg 1620 
tggtcttcat cacaaactat ctacaggaaa 1680 
tctggttcat cttctgcatt ttcggacaac 1740 
tgatgaaccg aaaaggatcg atgtcatgaa 18 00 
catctatggc ctcgttggat ctccgttgat 1860 
atagtgttat aaccattgaa gaagaaaaga 1920 
1942 



<210> 43 

<211> 234 

<212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (1) . . (234) 
<223> n=unknown 

<400> 43 

gtaagcttca agagcttagc atanttcctg 
ctatcctcgc acaccttata ttcgaaaggg 
atatttacag gagttatggg atttataata 
tcacagcatc ctctcaaggg aaaccttctt 



gttgccccta ncattatgtt accagccaan 60 
ttggctgttt cgccaacttg tcaactgata 120 
gaacaataca ttaatcccat tgtacaaaat 180 
tacgccatcg agagagttct gaag 234 



<210> 44 

<211> 267 

<212> DNA 

<213> Glycine max 



<400> 44 

ctgcttttgt atctggtgtc acgttgatgc 
tgtcatatgc acatacaaac tatgatatga 
aaacattacc caatactttg atatggagta 
atacttcatg gttgctccta cattatgcta 
tcgaaagggt tgggtgtttc gtcaact 



tattaacttg cattgtgtgg ttaaaattgg 60 
gagcacttac tgtttcgaat gaaaagggag 12 0 
tccgtacact gtgaccttca ggagtttggc 180 
tcagacaagc tatcctcgca caccttcagt 240 
267 



<210> 45 

<211> 275 

<212> DNA 

<213> Glycine max 

<220> 

<221> unsure 
<222> (1) . . (275) 
<223> n=unknown 

<400> 45 

gtggaatgcc aaaactgttg aagattattg 
gatgatccgc cacctatatt ttccatgttt 
tttaattgcc ttcctggttc tgctttattc 
catattcaag tngtgggttt cngnggaatt 
attnntnggc naaaaaattc cnngaacccc 



gaggatgtgg aatatgcctg ttcacaaatg 60 
aaggcacggt ataccaaagg ccgttgctct 12 0 
catgagctgt gcatcgctgt tccttgccca 180 
nagtttcagg tnccttgggt ttcnaccnna 24 0 

ggggg 275 



<210> 46 

<211> 257 

<212> DNA 

<213> Glycine max 

<400> 46 

aacggaattg agactccaga gaatatgcca 
ggcttttgga aaaactggca tgcttccttc 
cctcttgggg gatctaagaa aaagctacta 
atctggcatg atttagagtg gaagcttctt 
atccctgagt tggtttt 



aaatgtatta ataattgtca caacttggaa 60 
aacaagtggc ttgtgaggta tatatacatt 12 0 
aatgtgtggg ttgttttcac atttgttgca 180 
tcatgggcat ggttgacgtg tttattcttc 240 
257 



<210> 47 
<211> 253 
<212> DNA 
<213> Zea mays 

<400> 47 

agaaaatgga acatgcctgt gcataaatgg attgttcgtc atatatattt tccttgcatg 60 
cgaaatggta tatcaaagga agttgctgtt tttatatcgt tcttgtttct gctgtacttc 12 0 



atgagttatg tgttgctgtt ccctgccaca tactcaagtt ctgggctttt tttaggaatc 18 0 
atgcttcaga ttcccctcat catattgaca tcatacctca aaaataaatt cagtgacaca 24 0 
atggttggca ata 253 



<210> 48 

<211> 254 

<212> DNA 

<213> Zea mays 

<400> 48 

tgaagtatgg cttattaata agatctggct 
ggccactgct aatgtgttgc cttagtctac 
aaaagttggc attcaacaat ctcattagtg 
ttacaacatt tgaaattgta tatccagtgc 
tatcaggctt tgtg 



tttggtttaa tgctacatca ttgcgagact 60 
ccatatttcc ccttggtgca tttgcagtcg 12 0 
atcctgctac tacctgtttt cacatccttt 180 
tcgtgattct taagtgtgat tctgcagttt 24 0 
254 



<210> 49 
<211> 262 
<212> DNA 
<213> Zea mays 

<400> 49 

gaagtatggc ttattaataa gatctggctt 
gccactgcta atgtgttgcc ttagtctacc 
aaagttggca ttcaacaatc tcattagtga 
tacaacattt gaaattgtat atccagtgct 
acaggctttg tgttgatgtt ta 



ttggtttaat gctacatcat tgcgagactg 60 
catatttccc cttggtgcat ttgcagtcga 120 
tcctgctact acctgttttc acatcctttt 180 
cgtgattctt aagtgtgatt ctgcagtttt 240 
262 



<210> 50 
<211> 325 
<212> DNA 
<213> Zea mays 

<220> 

<2 21> unsure 
<222> (1) . . (325) 
<223> n=unknown 

<400> 50 

taatcnaacc tcgntncngg ttcagctgta 
cacatantca natctnggca tnncngggat 
aagatatctc catgctacgt tcaagcatgt 
cagtatagtc ggacagccga tgtnnnnnna 
ggcccaggca agtagatagt ncggcagaga 
nacngagcga gcggcangaa ncagc 



tnccatgaga tatgtaatgc ggtgccgtgc 60 
catngttcag ataccgntgg nattcttgac 120 
aatggtgggc aacatgatan tttggntctn 180 
tctatactac catgacgtca tgaacaggca 240 
catgtacttc aacatcganc atcagnagca 3 00 
325 



<210> 51 
<211> 519 
<212> DNA 

<213> Mortierrella alpina 
<220> 

<221> unsure 
<222> (1) . . (519) 
<223> n=unknown 

<400> 51 

gagnnnngna acgtttagcc tnccgtagcc gccaaaatcc aagggncnac cnaccctncg 60 
ttanactnaa ttngaaaatn cnnncccaac ttnaggnact tnnagncccc ccnacttgac 12 0 
aacggagcac tatatttacc ccgtggtngt tcaacccagc catctcaccc ttgcgagcat 180 
tggtgctgct cttgataccc ttcatgctta actatctcat gatcttttac atcattttcg 240 
agtgcatctg caacgccttt gcggaactaa gttgctttgc ggatcgcaac ttttacgagg 300 
attggtggaa ctgcgtcagc tttgatgagt gggcacgcaa atggaacaag cctgtgcaac 360 
acttcttgct ccgccacgtg tacgactcga gcatccgagt ccttccactt gtccgaaatc 42 0 
caatgccgcn aattgcaaac gttccttccc ggtcgtcaat gcgttcaacg aacctgggtg 480 
aagaatgggt ggtgacaacg ttaaagtgcg cccggtatc 519 



<210> 52 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 52 

ggatccgcgg ccgcacaatg aaaaaaatat cttcacatta ttcgg 45 



<210> 53 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 



<400> 53 

ggatcccctg caggtcattc attgacggca ttaacattgg 



40 



<210> 54 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 54 

ggatccgcgg ccgcacaatg ggagcgaatt cgaaatcagt aacg 44 

<210> 55 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 55 

ggatcccctg caggttaata cccactttta tcaagctccc 40 

<210> 56 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 56 

ggatccgcgg ccgcacaatg tctctattac tggaagagat c 41 

<210> 57 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



oligonucleotide primer 



<400> 57 

ggatcccctg caggttatgc atcaacagag acacttacag c 41 

<210> 58 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 58 

ggatccgcgg ccgcacaatg ggctggattc cgtgtccgtg c 41 

<210> 59 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 59 

ggatcccctg caggttaacc agaatcaact actttgtg 3 8 



<210> 60 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 60 

tcgacctgca ggaagcttag aaatggcgat tttggattc 39 



<210> 61 
<211> 36 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide primer 

<400> 61 

ggatccgcgg ccgctcatga catcgatcct tttcgg 36 

<210> 62 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Annealed 
oligonucleotide adapter 

<400> 62 

cgcgatttaa atggcgcgcc ctgcaggcgg ccgcctgcag ggcgcgccat ttaaat 56 

<210> 63 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 63 

tcgaggatcc gcggccgcaa gcttcctgca gg 32 

<210> 64 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 



<400> 64 

tcgacctgca ggaagcttgc ggccgcggat cc 



32 



<210> 65 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 65 

tcgacctgca ggaagcttgc ggccgcggat cc 32 

<210> 66 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 66 

tcgaggatcc gcggccgcaa gcttcctgca gg 32 

<210> 67 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 67 

tcgaggatcc gcggccgcaa gcttcctgca ggagct 36 

<210> 68 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 



<400> 68 

cctgcaggaa gcttgcggcc gcggatcc 



28 



<210> 69 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 69 

tcgacctgca ggaagcttgc ggccgcggat ccagct 3 6 

<210> 70 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 70 

ggatccgcgg ccgcaagctt cctgcagg 28 

<210> 71 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 71 

gatcacctgc aggaagcttg cggccgcgga tccaatgca 3 9 

<210> 72 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Ligating 
oligonucleotide 

<400> 72 

ttggatccgc ggccgcaagc ttcctgcagg t 31 



<210> 73 
<211> 2013 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 73 

atgcccctta ttcatcggaa aaagccgacg 
gtggtgcacg atgaggattc gcaaaagaaa 
aaatcgaacg gaggagggaa gtggtcgtgc 
gtgtgtgtaa cctggtggtt tcttctcttc 
cagtatgtaa cggagcgaat cacgggtcct 
aaagaaggtc ttaaggcgaa acatcctgtt 
ctcgagcttt gggaaggcaa acaatgcgct 
ggaacttttg gtgaagtcta caaaaggcct 
aatgaaactg ggttggatcc agctggtatt 
gctgactact ttgctcctgg ctactttgtc 
attggatatg aagagaaaaa tatgtacatg 
aacacagagg tacgtgatca gactcttagc 
tctaccaacg gtggaaaaaa agcagttata 
ctacatttta tgaagtgggt tgaggcacca 
tggtgtgcaa agtatattaa ggcggtgatg 
aaagctgttg cagggctttt ctctgctgaa 
gccccaggat tcttagacac cgatatattt 
atgacacgca catgggactc aacaatgtct 
ggcgggcttg attggtcacc ggagaaaggc 
aacgaaactt gtggtgaagc aggtgaaaac 
ggaaggatga tatcttttgg gaaagaagta 
attgattttc gaggtgctgt caaaggtcag 
tggacagagt accatgacat gggaattgct 
tacactgctg gtgaagctat agatctacta 
ggtgccgctc atttctctta tggaattgct 
cccaaatact ggtcaaatcc gttagagaca 
tactcattat acggagtggg gataccaacg 
tctcccgaca gttgcatccc ctttcagata 
agctgtctga aagcaggagt ttacaatgtg 
gccgggtaca tgtgtgcaaa agcgtggcgt 
aagacttata taagagaata caatcactct 
acgcagagtg gtgcccatgt tgatatcatg 
agggttgccg ccggaggtaa cgggtctgat 



gagaaaccat cgacgccgcc atctgaagag 60 
ccacacgaat cttccaaatc ccaccataag 12 0 
atcgattctt gttgttggtt cattgggtgt 180 
ctttacaacg caatgcctgc gagcttccct 240 
ttgcctgacc cgcccggtgt taagctcaaa 3 00 
gtcttcattc ctgggattgt caccggtggg 360 
gatggtttat ttagaaaacg tttgtggggt 42 0 
ctatgttggg tggaacacat gtcacttgac 48 0 
agagttcgag ctgtatcagg actcgtggct 54 0 
tgggcagtgc tgattgctaa ccttgcacat 60 0 
gctgcatatg actggcggct ttcgtttcag 660 
cgtatgaaaa gtaatataga gttgatggtt 72 0 
gttccgcatt ccatgggggt cttgtatttt 780 
gctcctctgg gtggcggggg tgggccagat 840 
aacattggtg gaccatttct tggtgttcca 900 
gcaaaggatg ttgcagttgc cagagcgatt 960 
agacttcaga ccttgcagca tgtaatgaga 1020 
atgttaccga agggaggtga cacgatatgg 1080 
cacacctgtt gtgggaaaaa gcaaaagaac 1140 
ggagtttcca agaaaagtcc tgttaactat 12 00 
gcagaggctg cgccatctga gattaataat 1260 
agtatcccaa atcacacctg tcgtgacgtg 1320 
gggatcaaag ctatcgctga gtataaggtc 13 80 
cattatgttg ctcctaagat gatggcgcgt 1440 
gatgatttgg atgacaccaa gtatcaagat 1500 
aaattaccga atgctcctga gatggaaatc 1560 
gaacgagcat acgtatacaa gcttaaccag 1620 
ttcacttctg ctcacgagga ggacgaagat 1680 
gatggggatg aaacagtacc cgtcctaagt 1740 
ggcaagacaa gattcaaccc ttccggaatc 1800 
ccgccggcta acctgttgga agggcgcggg 1860 
ggaaactttg ctttgatcga agatatcatg 1920 
ataggacatg accaggtcca ctctggcata 1980 



tttgaatggt cggagcgtat tgacctgaag ctg 



2013 



<210> 74 

<211> 671 

<212> PRT 

<213> Arabidopsis 



thaliana 



<400> 74 

Met Pro Leu lie His Arg Lys Lys Pro Thr Glu Lys Pro Ser Thr Pro 
15 10 15 

Pro Ser Glu Glu Val Val His Asp Glu Asp Ser Gin Lys Lys Pro His 
20 25 30 



Glu Ser Ser Lys 
35 

Ser Cys lie Asp 
50 

Trp Trp Phe Leu 
65 

Gin Tyr Val Thr 



Val Lys Leu Lys 
100 

lie Pro Gly He 
115 

Cys Ala Asp Gly 
130 

Glu Val Tyr Lys 
145 

Asn Glu Thr Gly 



Gly Leu Val Ala 
180 



Ser His His Lys 
40 

Ser Cys Cys Trp 
55 

Leu Phe Leu Tyr 
70 

Glu Arg He Thr 
85 

Lys Glu Gly Leu 

Val Thr Gly Gly 
120 

Leu Phe Arg Lys 
135 

Arg Pro Leu Cys 
150 

Leu Asp Pro Ala 
165 

Ala Asp Tyr Phe 



Lys Ser Asn Gly 



Phe He Gly Cys 
60 

Asn Ala Met Pro 
75 

Gly Pro Leu Pro 
90 

Lys Ala Lys His 
105 

Leu Glu Leu Trp 



Arg Leu Trp Gly 
140 

Trp Val Glu His 
155 

Gly He Arg Val 
170 

Ala Pro Gly Tyr 
185 



Gly Gly Lys Trp 
45 

Val Cys Val Thr 



Ala Ser Phe Pro 
80 

Asp Pro Pro Gly 
95 

Pro Val Val Phe 
110 

Glu Gly Lys Gin 
125 

Gly Thr Phe Gly 



Met Ser Leu Asp 
160 

Arg Ala Val Ser 
175 

Phe Val Trp Ala 
190 



Val Leu He Ala Asn Leu Ala His He Gly Tyr Glu Glu Lys Asn Met 
195 200 205 



Tyr Met Ala Ala Tyr 
210 

Arg Asp Gin Thr Leu 
225 

Ser Thr Asn Gly Gly 
245 

Val Leu Tyr Phe Leu 
260 

Leu Gly Gly Gly Gly 
275 

Val Met Asn lie Gly 
290 

Gly Leu Phe Ser Ala 
305 

Ala Pro Gly Phe Leu 
325 

His Val Met Arg Met 
340 

Pro Lys Gly Gly Asp 
355 

Lys Gly His Thr Cys 
370 

Gly Glu Ala Gly Glu 
385 

Gly Arg Met lie Ser 
405 

Glu He Asn Asn He 
420 

Pro Asn His Thr Cys 
435 

He Ala Gly He Lys 
450 



Asp Trp Arg Leu Ser Phe 
215 

Ser Arg Met Lys Ser Asn 
230 235 

Lys Lys Ala Val He Val 
250 

His Phe Met Lys Trp Val 
265 

Gly Pro Asp Trp Cys Ala 
280 

Gly Pro Phe Leu Gly Val 
295 

Glu Ala Lys Asp Val Ala 
310 315 

Asp Thr Asp He Phe Arg 
330 

Thr Arg Thr Trp Asp Ser 
345 

Thr He Trp Gly Gly Leu 
360 

Cys Gly Lys Lys Gin Lys 
375 

Asn Gly Val Ser Lys Lys 
390 395 

Phe Gly Lys Glu Val Ala 
410 

Asp Phe Arg Gly Ala Val 
425 

Arg Asp Val Trp Thr Glu 
440 

Ala He Ala Glu Tyr Lys 
455 



Gin Asn Thr Glu Val 
220 

He Glu Leu Met Val 
240 

Pro His Ser Met Gly 
255 

Glu Ala Pro Ala Pro 
270 

Lys Tyr He Lys Ala 
285 

Pro Lys Ala Val Ala 
300 

Val Ala Arg Ala He 
320 

Leu Gin Thr Leu Gin 
335 

Thr Met Ser Met Leu 
350 

Asp Trp Ser Pro Glu 
365 

Asn Asn Glu Thr Cys 
380 

Ser Pro Val Asn Tyr 
400 

Glu Ala Ala Pro Ser 
415 

Lys Gly Gin Ser He 
430 

Tyr His Asp Met Gly 
445 

Val Tyr Thr Ala Gly 
460 



Glu Ala lie Asp Leu Leu His Tyr Val Ala Pro Lys Met Met Ala Arg 
465 470 475 480 



Gly Ala Ala His Phe Ser Tyr Gly lie Ala Asp Asp Leu Asp Asp Thr 
485 490 495 

Lys Tyr Gin Asp Pro Lys Tyr Trp Ser Asn Pro Leu Glu Thr Lys Leu 
500 505 510 

Pro Asn Ala Pro Glu Met Glu lie Tyr Ser Leu Tyr Gly Val Gly lie 
515 520 525 

Pro Thr Glu Arg Ala Tyr Val Tyr Lys Leu Asn Gin Ser Pro Asp Ser 
530 535 540 

Cys lie Pro Phe Gin lie Phe Thr Ser Ala His Glu Glu Asp Glu Asp 
545 550 555 560 

Ser Cys Leu Lys Ala Gly Val Tyr Asn Val Asp Gly Asp Glu Thr Val 
565 570 575 

Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Ala Trp Arg Gly Lys 
580 585 590 

Thr Arg Phe Asn Pro Ser Gly lie Lys Thr Tyr lie Arg Glu Tyr Asn 
595 600 605 

His Ser Pro Pro Ala Asn Leu Leu Glu Gly Arg Gly Thr Gin Ser Gly 
610 615 620 

Ala His Val Asp lie Met Gly Asn Phe Ala Leu lie Glu Asp lie Met 
625 630 635 640 

Arg Val Ala Ala Gly Gly Asn Gly Ser Asp lie Gly His Asp Gin Val 
645 650 655 

His Ser Gly lie Phe Glu Trp Ser Glu Arg lie Asp Leu Lys Leu 
660 665 670 



<210> 75 
<211> 1986 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 75 

atgggcacac tgtttcgaag aaatgtccag aaccaaaaga gtgattctga tgaaaacaat 60 
aaagggggtt ctgttcataa caagcgagag agcagaaacc acattcatca tcaacaggga 12 0 



ttaggccata agagaagaag gggtattagt 
gatttcgaca ggaaaagaga cgggaacggt 
attttcattc ttggtgcatt cttaggtgta 
gttcataata gcgatagcga cttgtttgac 
tatttggatg attggaaaga tgttctccca 
caggctggta actactccac atcttcttta 
aaacaactct tacgtgatta taatatcgag 
gtcatttcta cgggaattga aagctgggga 
gcgcattttc gtaaacggct gtggggaagt 
aaagtttgtt ggttgaaaca tgtaatgtta 
tttacgctac gtgcagcaca gggcttcgaa 
atttggaaca aagttttcca aaatctggga 
agtgctgcgt atgattggag gcttgcatat 
acgaagctaa aggaacaaat cgaactgttt 
attggacatt ctatgggttc tcagattatc 
ggccctcttt acggtaatgg tggtcgtggc 
aatgcagcag ggacgcttct gggcgctcca 
atgaaagata ccattcaatt aaatacgtta 
agaattgaga gagtaaaaat gttacaaacg 
ggagaagagg tcatttgggg ggatatgaag 
actgacacat acggcaattt cattcgattt 
aatttgacaa tgaaagacgc cattaacatg 
agaagagtac atgagcagta ctcgttcggc 
aatgagctac accacaagca ctggtcgaat 
cacatgaaaa tctattgtat atacggggtg 
aaggaagagg atgactcctc tgctctgaat 
gtattcctca ccgaggggga cggaaccgtt 
tgggcccagg gtgcttcacc gtacaaccct 
aaacaccagc cagatcgatt tgatatacgt 
atcctcggca gcgcggagtt gaacgattac 
ctcgtcgagc cacgccaatt gtctaatttg 
atgtaa 



ggcagtgcaa aaagaaatga gcgtggcaaa 180 
agaaaacgtt ggagagattc cagaagactg 240 
cttttgccgt ttagctttgg cgcttatcat 300 
aactttgtaa attttgattc acttaaagtg 3 60 
caaggtataa gttcgtttat tgatgatatt 42 0 
gatgatctca gtgaaaattt tgccgttggt 480 
gccaaacatc ctgttgtaat ggttcctggt 540 
gttattggag acgatgagtg cgatagttct 600 
ttttacatgc tgagaacaat ggttatggat 660 
gatcctgaaa caggtctgga cccaccgaac 72 0 
tcaactgatt atttcatcgc agggtattgg 780 
gtaattggct atgaacccaa taaaatgacg 84 0 
ttagatctag aaagacgcga taggtacttt 90 0 
catcaattga gtggtgaaaa agtttgttta 960 
ttttacttta tgaaatgggt cgaggctgaa 1020 
tgggttaacg aacacataga ttcattcatt 10 80 
aaggcagttc cagctctaat tagtggtgaa 1140 
gccatgtatg gtttggaaaa gttcttctca 12 00 
tggggtggta taccatcaat gctaccaaag 12 60 
tcatcttcag aggatgcatt gaataacaac 1320 
gaaaggaata cgagcgatgc tttcaacaaa 13 80 
acattatcga tatcacctga atggctccaa 1440 
tattccaaga atgaagaaga gttaagaaaa 1500 
ccaatggaag taccacttcc agaagctccc 1560 
aacaacccaa ctgaaagggc atatgtatat 1620 
ttgaccatcg actacgaaag caagcaacct 1680 
ccgctcgtgg cgcattcaat gtgtcacaaa 1740 
gccggaatta acgttactat tgtggaaatg 1800 
ggtggagcaa aaagcgccga acacgtagac 1860 
atcttgaaaa ttgcaagcgg taatggcgat 1920 
agccagtggg tttctcagat gcccttccca 1980 
1986 



<210> 76 
<211> 661 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 76 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
15 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 



Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 



lie Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 60 



Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 



lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 



Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 

Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

Leu Pro Gin Gly lie Ser Ser Phe lie Asp Asp lie Gin Ala Gly Asn 
130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 
165 170 175 

Met Val Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 
180 185 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
245 250 255 

Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 
260 265 270 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 



Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 



Glu Gin lie Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 



lie Gly His Ser Met Gly Ser Gin lie lie Phe Tyr Phe Met Lys Trp 
325 330 335 

Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

Asn Glu His lie Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 

Ala Pro Lys Ala Val Pro Ala Leu lie Ser Gly Glu Met Lys Asp Thr 
370 375 380 

lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

Arg lie Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly lie Pro Ser 
405 410 415 

Met Leu Pro Lys Gly Glu Glu Val lie Trp Gly Asp Met Lys Ser Ser 
420 425 430 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe lie 
435 440 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

Lys Asp Ala lie Asn Met Thr Leu Ser lie Ser Pro Glu Trp Leu Gin 
465 470 475 480 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys lie Tyr Cys lie Tyr 
515 520 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 
530 535 540 



Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 



Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 



Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 

lie Asn Val Thr lie Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

lie Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp lie Leu Gly Ser 
610 615 620 

Ala Glu Leu Asn Asp Tyr lie Leu Lys lie Ala Ser Gly Asn Gly Asp 
625 630 635 640 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

Met Pro Phe Pro Met 
660 



<210> 77 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 77 

ggatccgcgg ccgcacaatg ccccttattc atcgg 35 

<210> 78 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 



<400> 78 

ggatcccctg caggtcacag cttcaggtca atacg 



35 



<210> 79 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 

<400> 79 

ggatccgcgg ccgcacaatg ggcacactct ttcgaag 3 7 

<210> 80 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Synthetic 
oligonucleotide primer 



<400> 80 

ggatcccctg caggttacat tgggcacact gtttcgaag 



39 



